Archives AI News

MuLoCo: Muon is a practical inner optimizer for DiLoCo

arXiv:2505.23725v2 Announce Type: replace Abstract: DiLoCo is a powerful framework for training large language models (LLMs), enabling larger optimal batch sizes and increased accelerator utilization under networking constraints. However, DiLoCo’s performance has been shown to degrade as the number of…

Spurious Rewards: Rethinking Training Signals in RLVR

arXiv:2506.10947v2 Announce Type: replace-cross Abstract: We show that reinforcement learning with verifiable rewards (RLVR) can elicit strong mathematical reasoning in certain language models even with spurious rewards that have little, no, or even negative correlation with the correct answer. For…

Optimizer choice matters for the emergence of Neural Collapse

arXiv:2602.16642v3 Announce Type: replace Abstract: Neural Collapse (NC) refers to the emergence of highly symmetric geometric structures in the representations of deep neural networks during the terminal phase of training. Despite its prevalence, the theoretical understanding of NC remains limited.…

Overparameterized Multiple Linear Regression as Hyper-Curve Fitting

arXiv:2404.07849v2 Announce Type: replace-cross Abstract: This work demonstrates that applying a fixed-effect multiple linear regression (MLR) model to an overparameterized dataset is mathematically equivalent to fitting a hyper-curve parameterized by a single scalar. This reformulation shifts the focus from global…