Archives AI News

(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models

arXiv:2604.16429v1 Announce Type: new Abstract: We introduce Mosaic, a probabilistic weather forecasting model that addresses two principal sources of spectral degradation in ML-based weather prediction: (1) deterministic training against ensemble means and (2) compressive encoding creating an information bottleneck. Mosaic…

April 21, 2026

Finding Culture-Sensitive Neurons in Vision-Language Models

arXiv:2510.24942v2 Announce Type: replace Abstract: Despite their impressive performance, vision-language models (VLMs) still struggle on culturally situated inputs. To understand how VLMs process culturally grounded information, we study the presence of culture-sensitive neurons, i.e., neurons whose activations show preferential sensitivity…

April 21, 2026

Dimensional Criticality at Grokking Across MLPs and Transformers

arXiv:2604.16431v1 Announce Type: new Abstract: Abrupt transitions between distinct dynamical regimes are a hallmark of complex systems. Grokking in deep neural networks provides a striking example — an abrupt transition from memorization to generalization long after training accuracy saturates —…

April 21, 2026

Rate-Distortion Optimization for Transformer Inference

arXiv:2601.22002v3 Announce Type: replace Abstract: Transformers achieve superior performance on many tasks, but impose heavy compute and memory requirements during inference. This inference can be made more efficient by partitioning the process across multiple devices, which, in turn, requires compressing…

April 21, 2026

Sampling for Quality: Training-Free Reward-Guided LLM Decoding via Sequential Monte Carlo

arXiv:2604.16453v1 Announce Type: new Abstract: We introduce a principled probabilistic framework for reward-guided decoding in large language models, addressing the limitations of standard decoding methods that optimize token-level likelihood rather than sequence-level quality. Our method defines a reward-augmented target distribution…

April 21, 2026

Instance-Adaptive Parametrization for Amortized Variational Inference

arXiv:2604.06796v2 Announce Type: replace Abstract: Variational autoencoders (VAEs) rely on amortized variational inference to enable efficient posterior approximation, but this efficiency comes at the cost of a shared parametrization, giving rise to the amortization gap. We propose the instance-adaptive variational…

April 21, 2026

Multi-Label Phase Diagram Prediction in Complex Alloys via Physics-Informed Graph Attention Networks

arXiv:2604.16468v1 Announce Type: new Abstract: Accurate phase equilibria are foundational to alloy design because they encode the underlying thermodynamics governing stability, transformations, and processing windows. However, while the CALculation of Phase Diagrams (CALPHAD) provides a rigorous thermodynamic framework, exploring multicomponent…

April 21, 2026

Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion

arXiv:2604.18566v1 Announce Type: cross Abstract: We present a systematic evaluation of large language model families — spanning both proprietary cloud APIs and locally-hosted open-source models — on two purpose-built benchmarks for System Dynamics AI assistance: the textbf{CLD Leaderboard} (53 tests,…

April 21, 2026

Non-Stationarity in the Embedding Space of Time Series Foundation Models

arXiv:2604.16428v1 Announce Type: new Abstract: Time series foundation models (TSFMs) are widely used as generic feature extractors, yet the notion of non-stationarity in their embedding spaces remains poorly understood. Recent work often conflates non-stationarity with distribution shift, blurring distinctions fundamental…

April 21, 2026

DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty

arXiv:2506.12622v2 Announce Type: replace Abstract: Deep reinforcement learning (RL) has achieved remarkable success, yet its deployment in real-world scenarios is often limited by vulnerability to environmental uncertainties. Distributionally robust RL (DR-RL) algorithms have been proposed to resolve this challenge, but…

April 21, 2026