Archives AI News

Why Do We Need Warm-up? A Theoretical Perspective

arXiv:2510.03164v1 Announce Type: cross Abstract: Learning rate warm-up – increasing the learning rate at the beginning of training – has become a ubiquitous heuristic in modern deep learning, yet its theoretical foundations remain poorly understood. In this work, we provide…

Best-of-Majority: Minimax-Optimal Strategy for Pass@$k$ Inference Scaling

arXiv:2510.03199v1 Announce Type: cross Abstract: LLM inference often generates a batch of candidates for a prompt and selects one via strategies like majority voting or Best-of- N (BoN). For difficult tasks, this single-shot selection often underperforms. Consequently, evaluations commonly report…

Dynamical local Fr’echet curve regression in manifolds

arXiv:2505.05168v2 Announce Type: replace-cross Abstract: The present paper solves the problem of local linear approximation of the Fr’echet conditional mean in an extrinsic and intrinsic way from time correlated bivariate curve data evaluated in a manifold (see Torres et al,…

A fast non-reversible sampler for Bayesian finite mixture models

arXiv:2510.03226v1 Announce Type: cross Abstract: Finite mixtures are a cornerstone of Bayesian modelling, and it is well-known that sampling from the resulting posterior distribution can be a hard task. In particular, popular reversible Markov chain Monte Carlo schemes are often…

Discrimination in machine learning algorithms

arXiv:2207.00108v2 Announce Type: replace Abstract: Machine learning algorithms are routinely used for business decisions that may directly affect individuals, for example, because a credit scoring algorithm refuses them a loan. It is then relevant from an ethical (and legal) point…