Archives AI News

On the optimization dynamics of RLVR: Gradient gap and step size thresholds

arXiv:2510.08539v1 Announce Type: cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR), which uses simple binary feedback to post-train large language models, has shown significant empirical success. However, a principled understanding of why it works has been lacking. This paper builds…

Beyond independent component analysis: identifiability and algorithms

arXiv:2510.07525v1 Announce Type: cross Abstract: Independent Component Analysis (ICA) is a classical method for recovering latent variables with useful identifiability properties. For independent variables, cumulant tensors are diagonal; relaxing independence yields tensors whose zero structure generalizes diagonality. These models have…

Rotated Mean-Field Variational Inference and Iterative Gaussianization

arXiv:2510.07732v1 Announce Type: cross Abstract: We propose to perform mean-field variational inference (MFVI) in a rotated coordinate system that reduces correlations between variables. The rotation is determined by principal component analysis (PCA) of a cross-covariance matrix involving the target’s score…

Disparate Conditional Prediction in Multiclass Classifiers

arXiv:2206.03234v4 Announce Type: replace-cross Abstract: We propose methods for auditing multiclass classifiers for fairness under multiclass equalized odds,by estimating the deviation from equalized odds when the classifier is not completely fair. We generalize to multiclass classifiers the measure of Disparate…