Archives AI News

Federated Learning in Offline and Online EMG Decoding: A Privacy and Performance Perspective

arXiv:2507.12652v2 Announce Type: replace Abstract: Neural interfaces offer a pathway to intuitive, high-bandwidth interaction, but the sensitive nature of neural data creates significant privacy hurdles for large-scale model training. Federated learning (FL) has emerged as a promising privacy-preserving solution, yet…

February 26, 2026

Benchmarking State Space Models, Transformers, and Recurrent Networks for US Grid Forecasting

arXiv:2602.21415v1 Announce Type: new Abstract: Selecting the right deep learning model for power grid forecasting is challenging, as performance heavily depends on the data available to the operator. This paper presents a comprehensive benchmark of five modern neural architectures: two…

February 26, 2026

Characterization and Learning of Causal Graphs with Latent Confounders and Post-treatment Selection from Interventional Data

arXiv:2509.25800v2 Announce Type: replace Abstract: Interventional causal discovery seeks to identify causal relations by leveraging distributional changes introduced by interventions, even in the presence of latent confounders. Beyond the spurious dependencies induced by latent confounders, we highlight a common yet…

February 26, 2026

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

arXiv:2602.21420v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has become the leading paradigm for enhancing reasoning in Large Language Models (LLMs). However, standard RLVR algorithms suffer from a well-documented pathology: while they improve Pass@1 accuracy through sharpened…

February 26, 2026

SPACeR: Self-Play Anchoring with Centralized Reference Models

arXiv:2510.18060v2 Announce Type: replace Abstract: Developing autonomous vehicles (AVs) requires not only safety and efficiency, but also realistic, human-like behaviors that are socially aware and predictable. Achieving this requires sim agent policies that are human-like, fast, and scalable in multi-agent…

February 26, 2026

On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation

arXiv:2602.21424v1 Announce Type: new Abstract: Reinforcement learning (RL) agents under partial observability often condition actions on internally accumulated information such as memory or inferred latent context. We formalise such information-conditioned interaction patterns as behavioural dependency: variation in action selection with…

February 26, 2026

New method could increase LLM training efficiency

By leveraging idle computing time, researchers can double the speed of model training while preserving accuracy.

February 26, 2026

Tackling industry’s burdensome bubble problem

MIT researchers uncovered the physics behind bubble-removing membranes that could improve bioreactors, chemical production, and more.

February 26, 2026

Spurious Rewards: Rethinking Training Signals in RLVR

arXiv:2506.10947v2 Announce Type: replace-cross Abstract: We show that reinforcement learning with verifiable rewards (RLVR) can elicit strong mathematical reasoning in certain language models even with spurious rewards that have little, no, or even negative correlation with the correct answer. For…

February 26, 2026

Optimizer choice matters for the emergence of Neural Collapse

arXiv:2602.16642v3 Announce Type: replace Abstract: Neural Collapse (NC) refers to the emergence of highly symmetric geometric structures in the representations of deep neural networks during the terminal phase of training. Despite its prevalence, the theoretical understanding of NC remains limited.…

February 26, 2026