Archives AI News

Simultaneous Blackwell Approachability and Applications to Multiclass Omniprediction

arXiv:2602.17577v1 Announce Type: cross Abstract: Omniprediction is a learning problem that requires suboptimality bounds for each of a family of losses $mathcal{L}$ against a family of comparator predictors $mathcal{C}$. We initiate the study of omniprediction in a multiclass setting, where…

February 20, 2026

Training Large Reasoning Models Efficiently via Progressive Thought Encoding

arXiv:2602.16839v1 Announce Type: new Abstract: Large reasoning models (LRMs) excel on complex problems but face a critical barrier to efficiency: reinforcement learning (RL) training requires long rollouts for outcome-based rewards, where autoregressive decoding dominates time and memory usage. While sliding-window…

February 20, 2026

Graph Machine Learning based Doubly Robust Estimator for Network Causal Effects

arXiv:2403.11332v3 Announce Type: replace Abstract: We address the challenge of inferring causal effects in social network data. This results in challenges due to interference — where a unit’s outcome is affected by neighbors’ treatments — and network-induced confounding factors. While…

February 20, 2026

What is the Value of Censored Data? An Exact Analysis for the Data-driven Newsvendor

arXiv:2602.16842v1 Announce Type: new Abstract: We study the offline data-driven newsvendor problem with censored demand data. In contrast to prior works where demand is fully observed, we consider the setting where demand is censored at the inventory level and only…

February 20, 2026

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

arXiv:2602.16849v1 Announce Type: new Abstract: We present a comprehensive analysis of how two-layer neural networks learn features to solve the modular addition task. Our work provides a full mechanistic interpretation of the learned model and a theoretical explanation of its…

February 20, 2026

Generating Directed Graphs with Dual Attention and Asymmetric Encoding

arXiv:2506.16404v3 Announce Type: replace Abstract: Directed graphs naturally model systems with asymmetric, ordered relationships, essential to applications in biology, transportation, social networks, and visual understanding. Generating such graphs enables tasks such as simulation, data augmentation and novel instance discovery; however,…

February 20, 2026

Position: Why a Dynamical Systems Perspective is Needed to Advance Time Series Modeling

arXiv:2602.16864v1 Announce Type: new Abstract: Time series (TS) modeling has come a long way from early statistical, mainly linear, approaches to the current trend in TS foundation models. With a lot of hype and industrial demand in this field, it…

February 20, 2026

Entropy After $langle texttt{/Think} rangle$ for reasoning model early exiting

arXiv:2509.26522v2 Announce Type: replace Abstract: Reasoning LLMs show improved performance with longer chains of thought. However, recent work has highlighted their tendency to overthink, continuing to revise answers even after reaching the correct solution. We quantitatively confirm this inefficiency from…

February 20, 2026

ML-driven detection and reduction of ballast information in multi-modal datasets

arXiv:2602.16876v1 Announce Type: new Abstract: Modern datasets often contain ballast as redundant or low-utility information that increases dimensionality, storage requirements, and computational cost without contributing meaningful analytical value. This study introduces a generalized, multimodal framework for ballast detection and reduction…

February 20, 2026

Chip-processing method could assist cryptography schemes to keep data secure

By enabling two chips to authenticate each other using a shared fingerprint, this technique can improve privacy and energy efficiency.

February 20, 2026