Archives AI News

A Simple Method for PMF Estimation on Large Supports

arXiv:2510.15132v1 Announce Type: new Abstract: We study nonparametric estimation of a probability mass function (PMF) on a large discrete support, where the PMF is multi-modal and heavy-tailed. The core idea is to treat the empirical PMF as a signal on…

Learning to Interpret Weight Differences in Language Models

arXiv:2510.05092v2 Announce Type: replace Abstract: Finetuning (pretrained) language models is a standard approach for updating their internal parametric knowledge and specializing them to new tasks and domains. However, the corresponding model weight changes (“weight diffs”) are not generally interpretable. While…

A simple mean field model of feature learning

arXiv:2510.15174v1 Announce Type: new Abstract: Feature learning (FL), where neural networks adapt their internal representations during training, remains poorly understood. Using methods from statistical physics, we derive a tractable, self-consistent mean-field (MF) theory for the Bayesian posterior of two-layer non-linear…

Bayesian Ego-graph inference for Networked Multi-Agent Reinforcement Learning

arXiv:2509.16606v2 Announce Type: replace-cross Abstract: In networked multi-agent reinforcement learning (Networked-MARL), decentralized agents must act under local observability and constrained communication over fixed physical graphs. Existing methods often assume static neighborhoods, limiting adaptability to dynamic or heterogeneous environments. While centralized…

Finding geodesics with the Deep Ritz method

arXiv:2510.15177v1 Announce Type: new Abstract: Geodesic problems involve computing trajectories between prescribed initial and final states to minimize a user-defined measure of distance, cost, or energy. They arise throughout physics and engineering — for instance, in determining optimal paths through…

Stochastic Optimization with Random Search

arXiv:2510.15610v1 Announce Type: cross Abstract: We revisit random search for stochastic optimization, where only noisy function evaluations are available. We show that the method works under weaker smoothness assumptions than previously considered, and that stronger assumptions enable improved guarantees. In…