Archives AI News

An Orthogonal Learner for Individualized Outcomes in Markov Decision Processes

arXiv:2509.26429v1 Announce Type: new Abstract: Predicting individualized potential outcomes in sequential decision-making is central for optimizing therapeutic decisions in personalized medicine (e.g., which dosing sequence to give to a cancer patient). However, predicting potential outcomes over long horizons is notoriously…

Understanding and Improving Shampoo and SOAP via Kullback-Leibler Minimization

arXiv:2509.03378v2 Announce Type: replace Abstract: Shampoo and its efficient variant, SOAP, use structured second-moment estimation and have attracted growing interest for their effectiveness in training neural networks (NNs). In practice, Shampoo requires step-size grafting with Adam to achieve competitive performance.…

On Fitting Flow Models with Large Sinkhorn Couplings

arXiv:2506.05526v3 Announce Type: replace-cross Abstract: Flow models transform data gradually from one modality (e.g. noise) onto another (e.g. images). Such models are parameterized by a time-dependent velocity field, trained to fit segments connecting pairs of source and target points. When…

AuON: A Linear-time Alternative to Semi-Orthogonal Momentum Updates

arXiv:2509.24320v2 Announce Type: replace-cross Abstract: Orthogonal gradient updates have emerged as a promising direction in optimization for machine learning. However, traditional approaches such as SVD/QR decomposition incur prohibitive computational costs of O(n^3) and underperform compared to well-tuned SGD with momentum,…