Archives AI News

An MLCommons Scientific Benchmarks Ontology

arXiv:2511.05614v1 Announce Type: new Abstract: Scientific machine learning research spans diverse domains and data modalities, yet existing benchmark efforts remain siloed and lack standardization. This makes novel and transformative applications of machine learning to critical scientific use-cases more fragmented and…

Language Generation with Infinite Contamination

arXiv:2511.07417v1 Announce Type: cross Abstract: We study language generation in the limit, where an algorithm observes an adversarial enumeration of strings from an unknown target language $K$ and must eventually generate new, unseen strings from $K$. Kleinberg and Mullainathan [KM24]…

Frequency Matters: When Time Series Foundation Models Fail Under Spectral Shift

arXiv:2511.05619v1 Announce Type: new Abstract: Time series foundation models (TSFMs) have shown strong results on public benchmarks, prompting comparisons to a “BERT moment” for time series. Their effectiveness in industrial settings, however, remains uncertain. We examine why TSFMs often struggle…

Fooling Algorithms in Non-Stationary Bandits using Belief Inertia

arXiv:2511.05620v1 Announce Type: new Abstract: We study the problem of worst case regret in piecewise stationary multi armed bandits. While the minimax theory for stationary bandits is well established, understanding analogous limits in time-varying settings is challenging. Existing lower bounds…

The Energy Cost of Reasoning: Analyzing Energy Usage in LLMs with Test-time Compute

arXiv:2505.14733v2 Announce Type: replace Abstract: Scaling large language models (LLMs) has driven significant advancements, yet it faces diminishing returns and escalating energy demands. This work explores how test-time compute (TTC) can serve as an energy-efficient complement to conventional scaling strategies…

Unveiling the Training Dynamics of ReLU Networks through a Linear Lens

arXiv:2511.05628v1 Announce Type: new Abstract: Deep neural networks, particularly those employing Rectified Linear Units (ReLU), are often perceived as complex, high-dimensional, non-linear systems. This complexity poses a significant challenge to understanding their internal learning mechanisms. In this work, we propose…

Learning Stochastic Multiscale Models

arXiv:2506.22655v2 Announce Type: replace Abstract: The physical sciences are replete with dynamical systems that require the resolution of a wide range of length and time scales. This presents significant computational challenges since direct numerical simulation requires discretization at the finest…