Archives AI News

CapTrack: Multifaceted Evaluation of Forgetting in LLM Post-Training

arXiv:2603.06610v1 Announce Type: new Abstract: Large language model (LLM) post-training enhances latent skills, unlocks value alignment, improves performance, and enables domain adaptation. Unfortunately, post-training is known to induce forgetting, especially in the ubiquitous use-case of leveraging third-party pre-trained models, which…

March 10, 2026

Discovering Semantic Latent Structures in Psychological Scales: A Response-Free Pathway to Efficient Simplification

arXiv:2602.12575v2 Announce Type: replace-cross Abstract: Psychological scale refinement traditionally relies on response-based methods such as factor analysis, item response theory, and network psychometrics to optimize item composition. Although rigorous, these approaches require large samples and may be constrained by data…

March 10, 2026

Consensus is Not Verification: Why Crowd Wisdom Strategies Fail for LLM Truthfulness

arXiv:2603.06612v1 Announce Type: new Abstract: Pass@k and other methods of scaling inference compute can improve language model performance in domains with external verifiers, including mathematics and code, where incorrect candidates can be filtered reliably. This raises a natural question: can…

March 10, 2026

Robust Transfer Learning with Side Information

arXiv:2603.07921v1 Announce Type: cross Abstract: Robust Markov Decision Processes (MDPs) address environmental shift through distributionally robust optimization (DRO) by finding an optimal worst-case policy within an uncertainty set of transition kernels. However, standard DRO approaches require enlarging the uncertainty set…

March 10, 2026

OptiRoulette Optimizer: A New Stochastic Meta-Optimizer for up to 5.3x Faster Convergence

arXiv:2603.06613v1 Announce Type: new Abstract: This paper presents OptiRoulette, a stochastic meta-optimizer that selects update rules during training instead of fixing a single optimizer. The method combines warmup optimizer locking, random sampling from an active optimizer pool, compatibility-aware learning-rate scaling…

March 10, 2026

Integrating Lagrangian Neural Networks into the Dyna Framework for Reinforcement Learning

arXiv:2603.08468v1 Announce Type: cross Abstract: Model-based reinforcement learning (MBRL) is sample-efficient but depends on the accuracy of the learned dynamics, which are often modeled using black-box methods that do not adhere to physical laws. Those methods tend to produce inaccurate…

March 10, 2026

Correlation Analysis of Generative Models

arXiv:2603.06614v1 Announce Type: new Abstract: Based on literature review about existing diffusion models and flow matching with a neural network to predict a predefined target from noisy data, a unified representation is first proposed for these models using two simple…

March 10, 2026

Puppet-CNN: Continuous Parameter Dynamics for Input-Adaptive Convolutional Networks

arXiv:2411.12876v2 Announce Type: replace Abstract: Modern convolutional neural networks (CNNs) organize computation as a discrete stack of layers whose parameters are independently stored and learned, with the number of layers fixed as an architectural hyperparameter. In this work, we explore…

March 10, 2026

Annealed Co-Generation: Disentangling Variables via Progressive Pairwise Modeling

arXiv:2603.06615v1 Announce Type: new Abstract: For multivariate co-generation in scientific applications, we advocate pairwise block rather than joint modeling of all variables. This design mitigates the computational burden and data imbalance. To this end, we propose an Annealed Co-Generation (ACG)…

March 10, 2026

RACER: Risk-Aware Calibrated Efficient Routing for Large Language Models

arXiv:2603.06616v1 Announce Type: new Abstract: Efficiently routing queries to the optimal large language model (LLM) is crucial for optimizing the cost-performance trade-off in multi-model systems. However, most existing routers rely on single-model selection, making them susceptible to misrouting. In this…

March 10, 2026