Archives AI News

Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance

arXiv:2507.17131v2 Announce Type: replace Abstract: Large language model (LLM) agents often struggle in environments where rules and required domain knowledge frequently change, such as regulatory compliance and user risk screening. Current approaches, like offline fine-tuning and standard prompting, are insufficient…

October 13, 2025

Don’t Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting

arXiv:2510.08696v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has become a standard recipe for improving large language models (LLMs) on reasoning tasks, with Group Relative Policy Optimization (GRPO) widely used in practice. Yet GRPO wastes substantial compute…

October 13, 2025

DQS: A Low-Budget Query Strategy for Enhancing Unsupervised Data-driven Anomaly Detection Approaches

arXiv:2509.05663v2 Announce Type: replace Abstract: Truly unsupervised approaches for time series anomaly detection are rare in the literature. Those that exist suffer from a poorly set threshold, which hampers detection performance, while others, despite claiming to be unsupervised, need to…

October 13, 2025

In-Context Learning for Non-Stationary MIMO Equalization

arXiv:2510.08711v1 Announce Type: new Abstract: Channel equalization is fundamental for mitigating distortions such as frequency-selective fading and inter-symbol interference. Unlike standard supervised learning approaches that require costly retraining or fine-tuning for each new task, in-context learning (ICL) adapts to new…

October 13, 2025

Synthetic Series-Symbol Data Generation for Time Series Foundation Models

arXiv:2510.08445v2 Announce Type: replace Abstract: Foundation models for time series analysis (TSA) have attracted significant attention. However, challenges such as training data scarcity and imbalance continue to hinder their development. Inspired by complex dynamic system theories, we design a series-symbol…

October 13, 2025

Enhancing Self-Supervised Learning with Semantic Pairs A New Dataset and Empirical Study

arXiv:2510.08722v1 Announce Type: new Abstract: Instance discrimination is a self-supervised representation learning paradigm wherein individual instances within a dataset are treated as distinct classes. This is typically achieved by generating two disparate views of each instance by applying stochastic transformations,…

October 13, 2025

Neural Beam Field for Spatial Beam RSRP Prediction

arXiv:2508.06956v2 Announce Type: replace-cross Abstract: Accurately predicting beam-level reference signal received power (RSRP) is essential for beam management in dense multi-user wireless networks, yet challenging due to high measurement overhead and fast channel variations. This paper proposes Neural Beam Field…

October 13, 2025

Counterfactually Fair Conformal Prediction

arXiv:2510.08724v1 Announce Type: new Abstract: While counterfactual fairness of point predictors is well studied, its extension to prediction sets–central to fair decision-making under uncertainty–remains underexplored. On the other hand, conformal prediction (CP) provides efficient, distribution-free, finite-sample valid prediction sets, yet…

October 13, 2025

A unified Bayesian framework for adversarial robustness

arXiv:2510.09288v1 Announce Type: cross Abstract: The vulnerability of machine learning models to adversarial attacks remains a critical security challenge. Traditional defenses, such as adversarial training, typically robustify models by minimizing a worst-case loss. However, these deterministic approaches do not account…

October 13, 2025

Transmuting prompts into weights

arXiv:2510.08734v1 Announce Type: new Abstract: A growing body of research has demonstrated that the behavior of large language models can be effectively controlled at inference time by directly modifying their internal states, either through vector additions to their activations or…

October 13, 2025