Archives AI News

A Sustainable AI Economy Needs Data Deals That Work for Generators

arXiv:2601.09966v1 Announce Type: new Abstract: We argue that the machine learning value chain is structurally unsustainable due to an economic data processing inequality: each state in the data cycle from inputs to model weights to synthetic outputs refines technical signal…

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

arXiv:2601.08763v2 Announce Type: replace Abstract: Reinforcement learning (RL) has become a central paradigm for post-training large language models (LLMs), particularly for complex reasoning tasks, yet it often suffers from exploration collapse: policies prematurely concentrate on a small set of dominant…

GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm

arXiv:2501.14230v4 Announce Type: replace-cross Abstract: Deep neural networks are highly vulnerable to adversarial examples, which are inputs with small, carefully crafted perturbations that cause misclassification — making adversarial attacks a critical tool for evaluating robustness. Existing black-box methods typically entail…

In-Context Operator Learning on the Space of Probability Measures

arXiv:2601.09979v1 Announce Type: new Abstract: We introduce emph{in-context operator learning on probability measure spaces} for optimal transport (OT). The goal is to learn a single solution operator that maps a pair of distributions to the OT map, using only few-shot…

OBLR-PO: A Theoretical Framework for Stable Reinforcement Learning

arXiv:2511.23310v2 Announce Type: replace-cross Abstract: Existing reinforcement learning (RL)-based post-training methods for large language models have advanced rapidly, yet their design has largely been guided by heuristics rather than systematic theoretical principles. This gap limits our understanding of the properties…

Continuous-Depth Transformers with Learned Control Dynamics

arXiv:2601.10007v1 Announce Type: new Abstract: We present a hybrid transformer architecture that replaces discrete middle layers with a continuous-depth Neural Ordinary Differential Equation (ODE) block, enabling inference-time control over generation attributes via a learned steering signal. Unlike standard transformers that…