Archives AI News

Weight Decay may matter more than muP for Learning Rate Transfer in Practice

arXiv:2510.19093v1 Announce Type: new Abstract: Transferring the optimal learning rate from small to large neural networks can enable efficient training at scales where hyperparameter tuning is otherwise prohibitively expensive. To this end, the Maximal Update Parameterization (muP) proposes a learning…

October 23, 2025

Rebalancing with Calibrated Sub-classes (RCS): A Statistical Fusion-based Framework for Robust Imbalanced Classification across Modalities

arXiv:2510.13656v2 Announce Type: replace Abstract: Class imbalance, where certain classes have insufficient data, poses a critical challenge for robust classification, often biasing models toward majority classes. Distribution calibration offers a promising avenue to address this by estimating more accurate class…

October 23, 2025

What Makes a Good Curriculum? Disentangling the Effects of Data Ordering on LLM Mathematical Reasoning

arXiv:2510.19099v1 Announce Type: new Abstract: Curriculum learning (CL) – ordering training data from easy to hard – has become a popular strategy for improving reasoning in large language models (LLMs). Yet prior work employs disparate difficulty metrics and training setups,…

October 23, 2025

Fast MRI for All: Bridging Access Gaps by Training without Raw Data

arXiv:2411.13022v3 Announce Type: replace-cross Abstract: Physics-driven deep learning (PD-DL) approaches have become popular for improved reconstruction of fast magnetic resonance imaging (MRI) scans. Though PD-DL offers higher acceleration rates than existing clinical fast MRI techniques, their use has been limited…

October 23, 2025

MetaCluster: Enabling Deep Compression of Kolmogorov-Arnold Network

arXiv:2510.19105v1 Announce Type: new Abstract: Kolmogorov-Arnold Networks (KANs) replace scalar weights with per-edge vectors of basis coefficients, thereby boosting expressivity and accuracy but at the same time resulting in a multiplicative increase in parameters and memory. We propose MetaCluster, a…

October 23, 2025

Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness

arXiv:2506.05735v4 Announce Type: replace-cross Abstract: Machine unlearning techniques aim to mitigate unintended memorization in large language models (LLMs). However, existing approaches predominantly focus on the explicit removal of isolated facts, often overlooking latent inferential dependencies and the non-deterministic nature of…

October 23, 2025

Learning Peer Influence Probabilities with Linear Contextual Bandits

arXiv:2510.19119v1 Announce Type: new Abstract: In networked environments, users frequently share recommendations about content, products, services, and courses of action with others. The extent to which such recommendations are successful and adopted is highly contextual, dependent on the characteristics of…

October 23, 2025

Democratizing AI scientists using ToolUniverse

arXiv:2509.23426v2 Announce Type: replace-cross Abstract: AI scientists are emerging computational systems that serve as collaborative partners in discovery. These systems remain difficult to build because they are bespoke, tied to rigid workflows, and lack shared environments that unify tools, data,…

October 23, 2025

Steering Autoregressive Music Generation with Recursive Feature Machines

arXiv:2510.19127v1 Announce Type: new Abstract: Controllable music generation remains a significant challenge, with existing methods often requiring model retraining or introducing audible artifacts. We introduce MusicRFM, a framework that adapts Recursive Feature Machines (RFMs) to enable fine-grained, interpretable control over…

October 23, 2025

Large Connectome Model: An fMRI Foundation Model of Brain Connectomes Empowered by Brain-Environment Interaction in Multitask Learning Landscape

arXiv:2510.18910v1 Announce Type: new Abstract: A reliable foundation model of functional neuroimages is critical to promote clinical applications where the performance of current AI models is significantly impeded by a limited sample size. To that end, tremendous efforts have been…

October 23, 2025