Archives AI News

Deep Learning for Continuous-Time Stochastic Control with Jumps

arXiv:2505.15602v3 Announce Type: replace Abstract: In this paper, we introduce a model-based deep-learning approach to solve finite-horizon continuous-time stochastic control problems with jumps. We iteratively train two neural networks: one to represent the optimal policy and the other to approximate…

A Sustainable AI Economy Needs Data Deals That Work for Generators

arXiv:2601.09966v1 Announce Type: new Abstract: We argue that the machine learning value chain is structurally unsustainable due to an economic data processing inequality: each state in the data cycle from inputs to model weights to synthetic outputs refines technical signal…

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

arXiv:2601.08763v2 Announce Type: replace Abstract: Reinforcement learning (RL) has become a central paradigm for post-training large language models (LLMs), particularly for complex reasoning tasks, yet it often suffers from exploration collapse: policies prematurely concentrate on a small set of dominant…

GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm

arXiv:2501.14230v4 Announce Type: replace-cross Abstract: Deep neural networks are highly vulnerable to adversarial examples, which are inputs with small, carefully crafted perturbations that cause misclassification — making adversarial attacks a critical tool for evaluating robustness. Existing black-box methods typically entail…

In-Context Operator Learning on the Space of Probability Measures

arXiv:2601.09979v1 Announce Type: new Abstract: We introduce emph{in-context operator learning on probability measure spaces} for optimal transport (OT). The goal is to learn a single solution operator that maps a pair of distributions to the OT map, using only few-shot…