Archives AI News

RLLaVA: An RL-central Framework for Language and Vision Assistants

arXiv:2512.21450v1 Announce Type: new Abstract: We present an RL-central framework for Language and Vision Assistants (RLLaVA) with its formulation of Markov decision process (MDP). RLLaVA decouples RL algorithmic logic from model architecture and distributed execution, supporting researchers in implementing new…

dUltra: Ultra-Fast Diffusion Language Models via Reinforcement Learning

arXiv:2512.21446v1 Announce Type: new Abstract: Masked diffusion language models (MDLMs) offer the potential for parallel token generation, but most open-source MDLMs decode fewer than 5 tokens per model forward pass even with sophisticated sampling strategies. As a result, their sampling…

An Equivariance Toolbox for Learning Dynamics

arXiv:2512.21447v1 Announce Type: new Abstract: Many theoretical results in deep learning can be traced to symmetry or equivariance of neural networks under parameter transformations. However, existing analyses are typically problem-specific and focus on first-order consequences such as conservation laws, while…

DeepCQ: General-Purpose Deep-Surrogate Framework for Lossy Compression Quality Prediction

arXiv:2512.21433v1 Announce Type: new Abstract: Error-bounded lossy compression techniques have become vital for scientific data management and analytics, given the ever-increasing volume of data generated by modern scientific simulations and instruments. Nevertheless, assessing data quality post-compression remains computationally expensive due…

kooplearn: A Scikit-Learn Compatible Library of Algorithms for Evolution Operator Learning

arXiv:2512.21409v1 Announce Type: new Abstract: kooplearn is a machine-learning library that implements linear, kernel, and deep-learning estimators of dynamical operators and their spectral decompositions. kooplearn can model both discrete-time evolution operators (Koopman/Transfer) and continuous-time infinitesimal generators. By learning these operators,…