Archives AI News

TimeSAE: Sparse Decoding for Faithful Explanations of Black-Box Time Series Models

arXiv:2601.09776v1 Announce Type: new Abstract: As black box models and pretrained models gain traction in time series applications, understanding and explaining their predictions becomes increasingly vital, especially in high-stakes domains where interpretability and trust are essential. However, most of the…

MixMin: Finding Data Mixtures via Convex Minimization

arXiv:2502.10510v3 Announce Type: replace Abstract: Modern machine learning pipelines are increasingly combining and mixing data from diverse and disparate sources, e.g., pre-training large language models. Yet, finding the optimal data mixture is a challenging and open problem. We formalize this…

Deep Learning for Continuous-Time Stochastic Control with Jumps

arXiv:2505.15602v3 Announce Type: replace Abstract: In this paper, we introduce a model-based deep-learning approach to solve finite-horizon continuous-time stochastic control problems with jumps. We iteratively train two neural networks: one to represent the optimal policy and the other to approximate…

A Sustainable AI Economy Needs Data Deals That Work for Generators

arXiv:2601.09966v1 Announce Type: new Abstract: We argue that the machine learning value chain is structurally unsustainable due to an economic data processing inequality: each state in the data cycle from inputs to model weights to synthetic outputs refines technical signal…

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

arXiv:2601.08763v2 Announce Type: replace Abstract: Reinforcement learning (RL) has become a central paradigm for post-training large language models (LLMs), particularly for complex reasoning tasks, yet it often suffers from exploration collapse: policies prematurely concentrate on a small set of dominant…

GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm

arXiv:2501.14230v4 Announce Type: replace-cross Abstract: Deep neural networks are highly vulnerable to adversarial examples, which are inputs with small, carefully crafted perturbations that cause misclassification — making adversarial attacks a critical tool for evaluating robustness. Existing black-box methods typically entail…