Archives AI News

Orthogonalized Policy Optimization:Policy Optimization as Orthogonal Projection in Hilbert Space

arXiv:2601.12415v5 Announce Type: replace Abstract: We propose Orthogonalized Policy Optimization (OPO), a principled framework for large language model alignment derived from optimization in the Hilbert function space L2(pi_k). Lifting policy updates from the probability simplex into L2(pi_k) transforms the nonlinear…

February 26, 2026

New method could increase LLM training efficiency

By leveraging idle computing time, researchers can double the speed of model training while preserving accuracy.

February 26, 2026

Tackling industry’s burdensome bubble problem

MIT researchers uncovered the physics behind bubble-removing membranes that could improve bioreactors, chemical production, and more.

February 26, 2026

Spurious Rewards: Rethinking Training Signals in RLVR

arXiv:2506.10947v2 Announce Type: replace-cross Abstract: We show that reinforcement learning with verifiable rewards (RLVR) can elicit strong mathematical reasoning in certain language models even with spurious rewards that have little, no, or even negative correlation with the correct answer. For…

February 26, 2026

Optimizer choice matters for the emergence of Neural Collapse

arXiv:2602.16642v3 Announce Type: replace Abstract: Neural Collapse (NC) refers to the emergence of highly symmetric geometric structures in the representations of deep neural networks during the terminal phase of training. Despite its prevalence, the theoretical understanding of NC remains limited.…

February 26, 2026

Overparameterized Multiple Linear Regression as Hyper-Curve Fitting

arXiv:2404.07849v2 Announce Type: replace-cross Abstract: This work demonstrates that applying a fixed-effect multiple linear regression (MLR) model to an overparameterized dataset is mathematically equivalent to fitting a hyper-curve parameterized by a single scalar. This reformulation shifts the focus from global…

February 26, 2026

Quantum enhanced ensemble GANs for anomaly detection in continuous biomanufacturing

arXiv:2508.21438v2 Announce Type: replace Abstract: The development of continuous biomanufacturing processes requires robust and early anomaly detection, since even minor deviations can compromise yield and stability, leading to disruptions in scheduling, reduced weekly production, and diminished economic performance. These processes…

February 26, 2026

NTK-Guided Implicit Neural Teaching

arXiv:2511.15487v2 Announce Type: replace Abstract: Implicit Neural Representations (INRs) parameterize continuous signals via multilayer perceptrons (MLPs), enabling compact, resolution-independent modeling for tasks like image, audio, and 3D reconstruction. However, fitting high-resolution signals demands optimizing over millions of coordinates, incurring prohibitive…

February 26, 2026

Coarsening Bias from Variable Discretization in Causal Functionals

arXiv:2602.22083v1 Announce Type: cross Abstract: A class of causal effect functionals requires integration over conditional densities of continuous variables, as in mediation effects and nonparametric identification in causal graphical models. Estimating such densities and evaluating the resulting integrals can be…

February 26, 2026

Active operator learning with predictive uncertainty quantification for partial differential equations

arXiv:2503.03178v4 Announce Type: replace Abstract: With the increased prevalence of neural operators being used to provide rapid solutions to partial differential equations (PDEs), understanding the accuracy of model predictions and the associated error levels is necessary for deploying reliable surrogate…

February 26, 2026