Archives AI News

LLM Meeting Decision Trees on Tabular Data

arXiv:2505.17918v2 Announce Type: replace Abstract: Tabular data have been playing a vital role in diverse real-world fields, including healthcare, finance, etc. With the recent success of Large Language Models (LLMs), early explorations of extending LLMs to the domain of tabular…

Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

arXiv:2512.09103v1 Announce Type: new Abstract: Data attribution methods identify which training examples are responsible for a model’s predictions, but their sensitivity to distributional perturbations undermines practical reliability. We present a unified framework for certified robust attribution that extends from convex…

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

arXiv:2511.19253v2 Announce Type: replace Abstract: Cooperative Multi-Agent Reinforcement Learning (MARL) faces two major design bottlenecks: crafting dense reward functions and constructing curricula that avoid local optima in high-dimensional, non-stationary environments. Existing approaches rely on fixed heuristics or use Large Language…

Learning Unmasking Policies for Diffusion Language Models

arXiv:2512.09106v1 Announce Type: new Abstract: Diffusion (Large) Language Models (dLLMs) now match the downstream performance of their autoregressive counterparts on many tasks, while holding the promise of being more efficient during inference. One particularly successful variant is masked discrete diffusion,…

Neural Diversity Regularizes Hallucinations in Language Models

arXiv:2510.20690v2 Announce Type: replace-cross Abstract: Language models continue to hallucinate despite increases in parameters, compute, and data. We propose neural diversity — decorrelated parallel representations — as a principled mechanism that reduces hallucination rates at fixed parameter and data budgets.…