Archives AI News

LLM Meeting Decision Trees on Tabular Data

arXiv:2505.17918v2 Announce Type: replace Abstract: Tabular data have been playing a vital role in diverse real-world fields, including healthcare, finance, etc. With the recent success of Large Language Models (LLMs), early explorations of extending LLMs to the domain of tabular…

December 11, 2025

Beyond the Hype: Comparing Lightweight and Deep Learning Models for Air Quality Forecasting

arXiv:2512.09076v1 Announce Type: new Abstract: Accurate forecasting of urban air pollution is essential for protecting public health and guiding mitigation policies. While Deep Learning (DL) and hybrid pipelines dominate recent research, their complexity and limited interpretability hinder operational use. This…

December 11, 2025

GS-KAN: Parameter-Efficient Kolmogorov-Arnold Networks via Sprecher-Type Shared Basis Functions

arXiv:2512.09084v1 Announce Type: new Abstract: The Kolmogorov-Arnold representation theorem offers a theoretical alternative to Multi-Layer Perceptrons (MLPs) by placing learnable univariate functions on edges rather than nodes. While recent implementations such as Kolmogorov-Arnold Networks (KANs) demonstrate high approximation capabilities, they…

December 11, 2025

Learning What Matters: Steering Diffusion via Spectrally Anisotropic Forward Noise

arXiv:2510.09660v4 Announce Type: replace Abstract: Diffusion Probabilistic Models (DPMs) have achieved strong generative performance, yet their inductive biases remain largely implicit. In this work, we aim to build inductive biases into the training and sampling of diffusion models to better…

December 11, 2025

Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

arXiv:2512.09103v1 Announce Type: new Abstract: Data attribution methods identify which training examples are responsible for a model’s predictions, but their sensitivity to distributional perturbations undermines practical reliability. We present a unified framework for certified robust attribution that extends from convex…

December 11, 2025

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

arXiv:2511.19253v2 Announce Type: replace Abstract: Cooperative Multi-Agent Reinforcement Learning (MARL) faces two major design bottlenecks: crafting dense reward functions and constructing curricula that avoid local optima in high-dimensional, non-stationary environments. Existing approaches rely on fixed heuristics or use Large Language…

December 11, 2025

Learning Unmasking Policies for Diffusion Language Models

arXiv:2512.09106v1 Announce Type: new Abstract: Diffusion (Large) Language Models (dLLMs) now match the downstream performance of their autoregressive counterparts on many tasks, while holding the promise of being more efficient during inference. One particularly successful variant is masked discrete diffusion,…

December 11, 2025

New materials could boost the energy efficiency of microelectronics

By stacking multiple active components based on new materials on the back end of a computer chip, this new approach reduces the amount of energy wasted during computation.

December 11, 2025

RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning

arXiv:2512.09829v1 Announce Type: cross Abstract: The massive scale of modern AI accelerators presents critical challenges to traditional fault assessment methodologies, which face prohibitive computational costs and provide poor coverage of critical failure modes. This paper introduces RIFT (Reinforcement Learning-guided Intelligent…

December 11, 2025

Neural Diversity Regularizes Hallucinations in Language Models

arXiv:2510.20690v2 Announce Type: replace-cross Abstract: Language models continue to hallucinate despite increases in parameters, compute, and data. We propose neural diversity — decorrelated parallel representations — as a principled mechanism that reduces hallucination rates at fixed parameter and data budgets.…

December 11, 2025