Archives AI News

Beyond the Ideal: Analyzing the Inexact Muon Update

arXiv:2510.19933v1 Announce Type: new Abstract: The Muon optimizer has rapidly emerged as a powerful, geometry-aware alternative to AdamW, demonstrating strong performance in large-scale training of neural networks. However, a critical theory-practice disconnect exists: Muon’s efficiency relies on fast, approximate orthogonalization,…

FINDER: Feature Inference on Noisy Datasets using Eigenspace Residuals

arXiv:2510.19917v1 Announce Type: new Abstract: ”Noisy” datasets (regimes with low signal to noise ratios, small sample sizes, faulty data collection, etc) remain a key research frontier for classification methods with both theoretical and practical implications. We introduce FINDER, a rigorous…

FairGRPO: Fair Reinforcement Learning for Equitable Clinical Reasoning

arXiv:2510.19893v1 Announce Type: new Abstract: Medical artificial intelligence systems have achieved remarkable diagnostic capabilities, yet they consistently exhibit performance disparities across demographic groups, causing real-world harm to underrepresented populations. While recent multimodal reasoning foundation models have advanced clinical diagnosis through…

From Large to Small: Transferring CUDA Optimization Expertise via Reasoning Graph

arXiv:2510.19873v1 Announce Type: new Abstract: Despite significant evolution of CUDA programming and domain-specific libraries, effectively utilizing GPUs with massively parallel engines remains difficult. Large language models (LLMs) show strong potential in generating optimized CUDA code from sequential code. However, using…

Are Greedy Task Orderings Better Than Random in Continual Linear Regression?

arXiv:2510.19941v1 Announce Type: new Abstract: We analyze task orderings in continual learning for linear regression, assuming joint realizability of training data. We focus on orderings that greedily maximize dissimilarity between consecutive tasks, a concept briefly explored in prior work but…

Towards Robust Zero-Shot Reinforcement Learning

arXiv:2510.15382v2 Announce Type: replace Abstract: The recent development of zero-shot reinforcement learning (RL) has opened a new avenue for learning pre-trained generalist policies that can adapt to arbitrary new tasks in a zero-shot manner. While the popular Forward-Backward representations (FB)…