Archives AI News

Scaling Laws and Symmetry, Evidence from Neural Force Fields

arXiv:2510.09768v2 Announce Type: replace Abstract: We present an empirical study in the geometric task of learning interatomic potentials, which shows equivariance matters even more at larger scales; we show a clear power-law scaling behaviour with respect to data, parameters and…

S2O: Early Stopping for Sparse Attention via Online Permutation

arXiv:2602.22575v2 Announce Type: replace Abstract: Attention scales quadratically with sequence length, fundamentally limiting long-context inference. Existing block-granularity sparsification can reduce latency, but coarse blocks impose an intrinsic sparsity ceiling, making further improvements difficult even with carefully engineered designs. We present…

Learning in the Fisher Subspace: A Guided Initialization for LoRA Fine-Tuning

arXiv:2605.01046v2 Announce Type: replace Abstract: LoRA adapts large language models (LLMs) by restricting updates to low-rank subspaces of pre-trained weights. While this substantially reduces training cost, the effectiveness of adaptation critically depends on which subspace is chosen at initialization: a…

Using Echo-State Networks to Reproduce Rare Events in Chaotic Systems

arXiv:2505.16208v2 Announce Type: replace-cross Abstract: We apply Echo-State Networks to predict time series and statistical properties of the competitive Lotka-Volterra model in the chaotic regime. In particular, we demonstrate that Echo-State Networks successfully learn the chaotic attractor of the competitive…

Healthcare AI GYM for Medical Agents

arXiv:2605.02943v1 Announce Type: new Abstract: Clinical reasoning demands multi-step interactions — gathering patient history, ordering tests, interpreting results, and making safe treatment decisions — yet a unified training environment provides the breadth of clinical domains and specialized tools to train…