Archives AI News

Learning to Solve Complex Problems via Dataset Decomposition

arXiv:2602.20296v1 Announce Type: new Abstract: Curriculum learning is a class of training strategies that organizes the data being exposed to a model by difficulty, gradually from simpler to more complex examples. This research explores a reverse curriculum generation approach that…

Watermarking Degrades Alignment in Language Models: Analysis and Mitigation

arXiv:2506.04462v4 Announce Type: replace-cross Abstract: Watermarking has become a practical tool for tracing language model outputs, but it modifies token probabilities at inference time, which were carefully tuned by alignment training. This creates a tension: how do watermark-induced shifts interact…

Uncertainty Calibration of Multi-Label Bird Sound Classifiers

arXiv:2511.08261v2 Announce Type: replace-cross Abstract: Passive acoustic monitoring enables large-scale biodiversity assessment, but reliable classification of bioacoustic sounds requires not only high accuracy but also well-calibrated uncertainty estimates to ground decision-making. In bioacoustics, calibration is challenged by overlapping vocalisations, long-tailed…

In-context Pre-trained Time-Series Foundation Models adapt to Unseen Tasks

arXiv:2602.20307v1 Announce Type: new Abstract: Time-series foundation models (TSFMs) have demonstrated strong generalization capabilities across diverse datasets and tasks. However, existing foundation models are typically pre-trained to enhance performance on specific tasks and often struggle to generalize to unseen tasks…

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

arXiv:2602.20156v2 Announce Type: replace-cross Abstract: LLM agents are evolving rapidly, powered by code execution, tools, and the recently introduced agent skills feature. Skills allow users to extend LLM applications with specialized third-party code, knowledge, and instructions. Although this can extend…

CaDrift: A Time-dependent Causal Generator of Drifting Data Streams

arXiv:2602.20329v1 Announce Type: new Abstract: This work presents Causal Drift Generator (CaDrift), a time-dependent synthetic data generator framework based on Structural Causal Models (SCMs). The framework produces a virtually infinite combination of data streams with controlled shift events and time-dependent…

Towards Attributions of Input Variables in a Coalition

arXiv:2309.13411v3 Announce Type: replace Abstract: This paper focuses on the fundamental challenge of partitioning input variables in attribution methods for Explainable AI, particularly in Shapley value-based approaches. Previous methods always compute attributions given a predefined partition but lack theoretical guidance…