Archives AI News

On the optimization dynamics of RLVR: Gradient gap and step size thresholds

arXiv:2510.08539v1 Announce Type: cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR), which uses simple binary feedback to post-train large language models, has shown significant empirical success. However, a principled understanding of why it works has been lacking. This paper builds…

October 10, 2025

Reconstructing the local density field with combined convolutional and point cloud architecture

arXiv:2510.08573v1 Announce Type: cross Abstract: We construct a neural network to perform regression on the local dark-matter density field given line-of-sight peculiar velocities of dark-matter halos, biased tracers of the dark matter field. Our architecture combines a convolutional U-Net with…

October 10, 2025

Beyond independent component analysis: identifiability and algorithms

arXiv:2510.07525v1 Announce Type: cross Abstract: Independent Component Analysis (ICA) is a classical method for recovering latent variables with useful identifiability properties. For independent variables, cumulant tensors are diagonal; relaxing independence yields tensors whose zero structure generalizes diagonality. These models have…

October 10, 2025

Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation

arXiv:2502.02463v2 Announce Type: replace Abstract: While Bayesian inference provides a principled framework for reasoning under uncertainty, its widespread adoption is limited by the intractability of exact posterior computation, necessitating the use of approximate inference. However, existing methods are often computationally…

October 10, 2025

Transferable Generative Models Bridge Femtosecond to Nanosecond Time-Step Molecular Dynamics

arXiv:2510.07589v1 Announce Type: cross Abstract: Understanding molecular structure, dynamics, and reactivity requires bridging processes that occur across widely separated time scales. Conventional molecular dynamics simulations provide atomistic resolution, but their femtosecond time steps limit access to the slow conformational changes…

October 10, 2025

Rotated Mean-Field Variational Inference and Iterative Gaussianization

arXiv:2510.07732v1 Announce Type: cross Abstract: We propose to perform mean-field variational inference (MFVI) in a rotated coordinate system that reduces correlations between variables. The rotation is determined by principal component analysis (PCA) of a cross-covariance matrix involving the target’s score…

October 10, 2025

PO-Flow: Flow-based Generative Models for Sampling Potential Outcomes and Counterfactuals

arXiv:2505.16051v2 Announce Type: replace Abstract: Predicting potential and counterfactual outcomes from observational data is central to clinical decision-making, where physicians must weigh treatments for an individual patient rather than relying solely on average effects at the population level. We propose…

October 10, 2025

Multi-level informed optimization via decomposed Kriging for large design problems under uncertainty

arXiv:2510.07904v1 Announce Type: cross Abstract: Engineering design involves demanding models encompassing many decision variables and uncontrollable parameters. In addition, unavoidable aleatoric and epistemic uncertainties can be very impactful and add further complexity. The state-of-the-art adopts two steps, uncertainty quantification and…

October 10, 2025

Some theoretical improvements on the tightness of PAC-Bayes risk certificates for neural networks

arXiv:2510.07935v1 Announce Type: cross Abstract: This paper presents four theoretical contributions that improve the usability of risk certificates for neural networks based on PAC-Bayes bounds. First, two bounds on the KL divergence between Bernoulli distributions enable the derivation of the…

October 10, 2025

Disparate Conditional Prediction in Multiclass Classifiers

arXiv:2206.03234v4 Announce Type: replace-cross Abstract: We propose methods for auditing multiclass classifiers for fairness under multiclass equalized odds,by estimating the deviation from equalized odds when the classifier is not completely fair. We generalize to multiclass classifiers the measure of Disparate…

October 10, 2025