Archives AI News

Optimizing Stochastic Gradient Push under Broadcast Communications

arXiv:2604.15549v1 Announce Type: new Abstract: We consider the problem of minimizing the convergence time for decentralized federated learning (DFL) in wireless networks under broadcast communications, with focus on mixing matrix design. The mixing matrix is a critical hyperparameter for DFL…

April 20, 2026

Bridging the phenotype-target gap for molecular generation via multi-objective reinforcement learning

arXiv:2509.21010v2 Announce Type: replace Abstract: The de novo generation of drug-like molecules capable of inducing desirable phenotypic changes is receiving increasing attention. However, previous methods predominantly rely on expression profiles to guide molecule generation, but overlook the perturbative effect of…

April 20, 2026

Natural gradient descent with momentum

arXiv:2604.15554v1 Announce Type: new Abstract: We consider the problem of approximating a function by an element of a nonlinear manifold which admits a differentiable parametrization, typical examples being neural networks with differentiable activation functions or tensor networks. Natural gradient descent…

April 20, 2026

Power to the Clients: Federated Learning in a Dictatorship Setting

arXiv:2510.22149v3 Announce Type: replace Abstract: Federated learning (FL) has emerged as a promising paradigm for decentralized model training, enabling multiple clients to collaboratively learn a shared model without exchanging their local data. However, the decentralized nature of FL also introduces…

April 20, 2026

Learning Affine-Equivariant Proximal Operators

arXiv:2604.15556v1 Announce Type: new Abstract: Proximal operators are fundamental across many applications in signal processing and machine learning, including solving ill-posed inverse problems. Recent work has introduced Learned Proximal Networks (LPNs), providing parametric functions that compute exact proximals for data-driven…

April 20, 2026

Dynamic Tool Dependency Retrieval for Lightweight Function Calling

arXiv:2512.17052v4 Announce Type: replace Abstract: Function calling agents powered by Large Language Models (LLMs) select external tools to automate complex tasks. On-device agents typically use a retrieval module to select relevant tools, improving performance and reducing context length. However, existing…

April 20, 2026

Predicting Where Steering Vectors Succeed

arXiv:2604.15557v1 Announce Type: new Abstract: Steering vectors work for some concepts and layers but fail for others, and practitioners have no way to predict which setting applies before running an intervention. We introduce the Linear Accessibility Profile (LAP), a per-layer…

April 20, 2026

Plateaus, Optima, and Overfitting in Multi-Layer Perceptrons: A Saddle-Saddle-Attractor Scenario

arXiv:2604.02393v2 Announce Type: replace Abstract: Vanishing gradients and overfitting are central problems in machine learning, yet are typically analyzed in asymptotic regimes that obscure their dynamical origins. Here we provide a dynamical description of learning in multi-layer perceptrons (MLPs) via…

April 20, 2026

Reward Weighted Classifier-Free Guidance as Policy Improvement in Autoregressive Models

arXiv:2604.15577v1 Announce Type: new Abstract: Consider an auto-regressive model that produces outputs x (e.g., answers to questions, molecules) each of which can be summarized by an attribute vector y (e.g., helpfulness vs. harmlessness, or bio-availability vs. lipophilicity). An arbitrary reward…

April 20, 2026

SOAR: Self-Correction for Optimal Alignment and Refinement in Diffusion Models

arXiv:2604.12617v2 Announce Type: replace Abstract: The post-training pipeline for diffusion models currently has two stages: supervised fine-tuning (SFT) on curated data and reinforcement learning (RL) with reward models. A fundamental gap separates them. SFT optimizes the denoiser only on ground-truth…

April 20, 2026