Archives AI News

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning

arXiv:2507.16814v2 Announce Type: replace Abstract: Enhancing large vision-language models (LVLMs) with visual slow-thinking reasoning is crucial for solving complex multimodal tasks. However, since LVLMs are mainly trained with vision-language alignment, it is difficult to adopt on-policy reinforcement learning (RL) to…

Weight Decay may matter more than muP for Learning Rate Transfer in Practice

arXiv:2510.19093v1 Announce Type: new Abstract: Transferring the optimal learning rate from small to large neural networks can enable efficient training at scales where hyperparameter tuning is otherwise prohibitively expensive. To this end, the Maximal Update Parameterization (muP) proposes a learning…

Fast MRI for All: Bridging Access Gaps by Training without Raw Data

arXiv:2411.13022v3 Announce Type: replace-cross Abstract: Physics-driven deep learning (PD-DL) approaches have become popular for improved reconstruction of fast magnetic resonance imaging (MRI) scans. Though PD-DL offers higher acceleration rates than existing clinical fast MRI techniques, their use has been limited…

MetaCluster: Enabling Deep Compression of Kolmogorov-Arnold Network

arXiv:2510.19105v1 Announce Type: new Abstract: Kolmogorov-Arnold Networks (KANs) replace scalar weights with per-edge vectors of basis coefficients, thereby boosting expressivity and accuracy but at the same time resulting in a multiplicative increase in parameters and memory. We propose MetaCluster, a…

Learning Peer Influence Probabilities with Linear Contextual Bandits

arXiv:2510.19119v1 Announce Type: new Abstract: In networked environments, users frequently share recommendations about content, products, services, and courses of action with others. The extent to which such recommendations are successful and adopted is highly contextual, dependent on the characteristics of…

Democratizing AI scientists using ToolUniverse

arXiv:2509.23426v2 Announce Type: replace-cross Abstract: AI scientists are emerging computational systems that serve as collaborative partners in discovery. These systems remain difficult to build because they are bespoke, tied to rigid workflows, and lack shared environments that unify tools, data,…