Archives AI News

Learning by Steering the Neural Dynamics: A Statistical Mechanics Perspective

arXiv:2510.11984v1 Announce Type: new Abstract: Despite the striking successes of deep neural networks trained with gradient-based optimization, these methods differ fundamentally from their biological counterparts. This gap raises key questions about how nature achieves robust, sample-efficient learning at minimal energy…

MergeBench: A Benchmark for Merging Domain-Specialized LLMs

arXiv:2505.10833v4 Announce Type: replace Abstract: Model merging provides a scalable alternative to multi-task training by combining specialized finetuned models through parameter arithmetic, enabling efficient deployment without the need for joint training or access to all task data. While recent methods…

Mamaba Can Learn Low-Dimensional Targets In-Context via Test-Time Feature Learning

arXiv:2510.12026v1 Announce Type: new Abstract: Mamba, a recently proposed linear-time sequence model, has attracted significant attention for its computational efficiency and strong empirical performance. However, a rigorous theoretical understanding of its underlying mechanisms remains limited. In this work, we provide…

Your VAR Model is Secretly an Efficient and Explainable Generative Classifier

arXiv:2510.12060v1 Announce Type: new Abstract: Generative classifiers, which leverage conditional generative models for classification, have recently demonstrated desirable properties such as robustness to distribution shifts. However, recent progress in this area has been largely driven by diffusion-based models, whose substantial…

Offline Fictitious Self-Play for Competitive Games

arXiv:2403.00841v2 Announce Type: replace-cross Abstract: Offline Reinforcement Learning (RL) enables policy improvement from fixed datasets without online interactions, making it highly suitable for real-world applications lacking efficient simulators. Despite its success in the single-agent setting, offline multi-agent RL remains a…