Archives AI News

LLM Meeting Decision Trees on Tabular Data

arXiv:2505.17918v2 Announce Type: replace Abstract: Tabular data have been playing a vital role in diverse real-world fields, including healthcare, finance, etc. With the recent success of Large Language Models (LLMs), early explorations of extending LLMs to the domain of tabular…

Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

arXiv:2512.09103v1 Announce Type: new Abstract: Data attribution methods identify which training examples are responsible for a model’s predictions, but their sensitivity to distributional perturbations undermines practical reliability. We present a unified framework for certified robust attribution that extends from convex…

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

arXiv:2511.19253v2 Announce Type: replace Abstract: Cooperative Multi-Agent Reinforcement Learning (MARL) faces two major design bottlenecks: crafting dense reward functions and constructing curricula that avoid local optima in high-dimensional, non-stationary environments. Existing approaches rely on fixed heuristics or use Large Language…

Learning Unmasking Policies for Diffusion Language Models

arXiv:2512.09106v1 Announce Type: new Abstract: Diffusion (Large) Language Models (dLLMs) now match the downstream performance of their autoregressive counterparts on many tasks, while holding the promise of being more efficient during inference. One particularly successful variant is masked discrete diffusion,…

Local-Curvature-Aware Knowledge Graph Embedding: An Extended Ricci Flow Approach

arXiv:2512.07332v2 Announce Type: replace Abstract: Knowledge graph embedding (KGE) relies on the geometry of the embedding space to encode semantic and structural relations. Existing methods place all entities on one homogeneous manifold, Euclidean, spherical, hyperbolic, or their product/multi-curvature variants, to…

Spectral Embedding via Chebyshev Bases for Robust DeepONet Approximation

arXiv:2512.09165v1 Announce Type: new Abstract: Deep Operator Networks (DeepONets) have become a central tool in data-driven operator learning, providing flexible surrogates for nonlinear mappings arising in partial differential equations (PDEs). However, the standard trunk design based on fully connected layers…