Archives AI News

LLM Meeting Decision Trees on Tabular Data

arXiv:2505.17918v2 Announce Type: replace Abstract: Tabular data have been playing a vital role in diverse real-world fields, including healthcare, finance, etc. With the recent success of Large Language Models (LLMs), early explorations of extending LLMs to the domain of tabular…

December 11, 2025

Beyond the Hype: Comparing Lightweight and Deep Learning Models for Air Quality Forecasting

arXiv:2512.09076v1 Announce Type: new Abstract: Accurate forecasting of urban air pollution is essential for protecting public health and guiding mitigation policies. While Deep Learning (DL) and hybrid pipelines dominate recent research, their complexity and limited interpretability hinder operational use. This…

December 11, 2025

GS-KAN: Parameter-Efficient Kolmogorov-Arnold Networks via Sprecher-Type Shared Basis Functions

arXiv:2512.09084v1 Announce Type: new Abstract: The Kolmogorov-Arnold representation theorem offers a theoretical alternative to Multi-Layer Perceptrons (MLPs) by placing learnable univariate functions on edges rather than nodes. While recent implementations such as Kolmogorov-Arnold Networks (KANs) demonstrate high approximation capabilities, they…

December 11, 2025

Learning What Matters: Steering Diffusion via Spectrally Anisotropic Forward Noise

arXiv:2510.09660v4 Announce Type: replace Abstract: Diffusion Probabilistic Models (DPMs) have achieved strong generative performance, yet their inductive biases remain largely implicit. In this work, we aim to build inductive biases into the training and sampling of diffusion models to better…

December 11, 2025

Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

arXiv:2512.09103v1 Announce Type: new Abstract: Data attribution methods identify which training examples are responsible for a model’s predictions, but their sensitivity to distributional perturbations undermines practical reliability. We present a unified framework for certified robust attribution that extends from convex…

December 11, 2025

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

arXiv:2511.19253v2 Announce Type: replace Abstract: Cooperative Multi-Agent Reinforcement Learning (MARL) faces two major design bottlenecks: crafting dense reward functions and constructing curricula that avoid local optima in high-dimensional, non-stationary environments. Existing approaches rely on fixed heuristics or use Large Language…

December 11, 2025

Learning Unmasking Policies for Diffusion Language Models

arXiv:2512.09106v1 Announce Type: new Abstract: Diffusion (Large) Language Models (dLLMs) now match the downstream performance of their autoregressive counterparts on many tasks, while holding the promise of being more efficient during inference. One particularly successful variant is masked discrete diffusion,…

December 11, 2025

Local-Curvature-Aware Knowledge Graph Embedding: An Extended Ricci Flow Approach

arXiv:2512.07332v2 Announce Type: replace Abstract: Knowledge graph embedding (KGE) relies on the geometry of the embedding space to encode semantic and structural relations. Existing methods place all entities on one homogeneous manifold, Euclidean, spherical, hyperbolic, or their product/multi-curvature variants, to…

December 11, 2025

Spectral Embedding via Chebyshev Bases for Robust DeepONet Approximation

arXiv:2512.09165v1 Announce Type: new Abstract: Deep Operator Networks (DeepONets) have become a central tool in data-driven operator learning, providing flexible surrogates for nonlinear mappings arising in partial differential equations (PDEs). However, the standard trunk design based on fully connected layers…

December 11, 2025

A Multivariate Bernoulli-Based Sampling Method for Multi-Label Data with Application to Meta-Research

arXiv:2512.08371v2 Announce Type: replace Abstract: Datasets may contain observations with multiple labels. If the labels are not mutually exclusive, and if the labels vary greatly in frequency, obtaining a sample that includes sufficient observations with scarcer labels to make inferences…

December 11, 2025