Archives AI News

Hyperparameter Loss Surfaces Are Simple Near their Optima

arXiv:2510.02721v1 Announce Type: cross Abstract: Hyperparameters greatly impact models’ capabilities; however, modern models are too large for extensive search. Instead, researchers design recipes that train well across scales based on their understanding of the hyperparameters. Despite this importance, few tools…

Online Decision-Focused Learning

arXiv:2505.13564v2 Announce Type: replace-cross Abstract: Decision-focused learning (DFL) is an increasingly popular paradigm for training predictive models whose outputs are used in decision-making tasks. Instead of merely optimizing for predictive accuracy, DFL trains models to directly minimize the loss associated…

Differentially Private Wasserstein Barycenters

arXiv:2510.03021v1 Announce Type: cross Abstract: The Wasserstein barycenter is defined as the mean of a set of probability measures under the optimal transport metric, and has numerous applications spanning machine learning, statistics, and computer graphics. In practice these input measures…

Iteratively reweighted kernel machines efficiently learn sparse functions

arXiv:2505.08277v2 Announce Type: replace Abstract: The impressive practical performance of neural networks is often attributed to their ability to learn low-dimensional data representations and hierarchical structure directly from data. In this work, we argue that these two phenomena are not…