Archives AI News

Robustness in the Face of Partial Identifiability in Reward Learning

arXiv:2501.06376v2 Announce Type: replace-cross Abstract: In Reward Learning (ReL), we are given feedback on an unknown target reward, and the goal is to use this information to recover it in order to carry out some downstream application, e.g., planning. When…

BKP: An R Package for Beta Kernel Process Modeling

arXiv:2508.10447v2 Announce Type: replace-cross Abstract: We present BKP, a user-friendly and extensible R package that implements the Beta Kernel Process (BKP) — a fully nonparametric and computationally efficient framework for modeling spatially varying binomial probabilities. The BKP model combines localized…

Long-time dynamics and universality of nonconvex gradient descent

arXiv:2509.11426v1 Announce Type: cross Abstract: This paper develops a general approach to characterize the long-time trajectory behavior of nonconvex gradient descent in generalized single-index models in the large aspect ratio regime. In this regime, we show that for each iteration…