Archives AI News

SABER: Small Actions, Big Errors – Safeguarding Mutating Steps in LLM Agents

arXiv:2512.07850v1 Announce Type: new Abstract: Despite rapid progress in LLM agents, performance on long-horizon, tool-using tasks remains fragile. To better understand this fragility, we ask a simple question: emph{do all actions contribute equally to failure?} Analyzing execution traces on $tau$-Bench…

Multicalibration for LLM-based Code Generation

arXiv:2512.08810v1 Announce Type: cross Abstract: As AI-based code generation becomes widespread, researchers are investigating the calibration of code LLMs – ensuring their confidence scores faithfully represent the true likelihood of code correctness. To do so, we investigate multicalibration, which can…

FAIM: Frequency-Aware Interactive Mamba for Time Series Classification

arXiv:2512.07858v1 Announce Type: new Abstract: Time series classification (TSC) is crucial in numerous real-world applications, such as environmental monitoring, medical diagnosis, and posture recognition. TSC tasks require models to effectively capture discriminative information for accurate class identification. Although deep learning…

Score-based Conditional Out-of-Distribution Augmentation for Graph Covariate Shift

arXiv:2410.17506v2 Announce Type: replace Abstract: Distribution shifts between training and testing datasets significantly impair the model performance on graph learning. A commonly-taken causal view in graph invariant learning suggests that stable predictive features of graphs are causally associated with labels,…

SetAD: Semi-Supervised Anomaly Learning in Contextual Sets

arXiv:2512.07863v1 Announce Type: new Abstract: Semi-supervised anomaly detection (AD) has shown great promise by effectively leveraging limited labeled data. However, existing methods are typically structured around scoring individual points or simple pairs. Such {point- or pair-centric} view not only overlooks…