Archives AI News

CoPRIS: Efficient and Stable Reinforcement Learning via Concurrency-Controlled Partial Rollout with Importance Sampling

arXiv:2511.05589v1 Announce Type: new Abstract: Reinforcement learning (RL) post-training has become a trending paradigm for enhancing the capabilities of large language models (LLMs). Most existing RL systems for LLMs operate in a fully synchronous manner, where training must wait for…

November 11, 2025

From Invariant Representations to Invariant Data: Provable Robustness to Spurious Correlations via Noisy Counterfactual Matching

arXiv:2505.24843v2 Announce Type: replace Abstract: Models that learn spurious correlations from training data often fail when deployed in new environments. While many methods aim to learn invariant representations to address this, they often underperform standard empirical risk minimization (ERM). We…

November 11, 2025

FedSparQ: Adaptive Sparse Quantization with Error Feedback for Robust & Efficient Federated Learning

arXiv:2511.05591v1 Announce Type: new Abstract: Federated Learning (FL) enables collaborative model training across decentralized clients while preserving data privacy by keeping raw data local. However, FL suffers from significant communication overhead due to the frequent exchange of high-dimensional model updates…

November 11, 2025

An upper bound of the silhouette validation metric for clustering

arXiv:2509.08625v2 Announce Type: replace Abstract: The silhouette coefficient quantifies, for each observation, the balance between within-cluster cohesion and between-cluster separation, taking values in [-1, 1]. The average silhouette width (ASW) is a widely used internal measure of clustering quality, with…

November 11, 2025

GRAVER: Generative Graph Vocabularies for Robust Graph Foundation Models Fine-tuning

arXiv:2511.05592v1 Announce Type: new Abstract: Inspired by the remarkable success of foundation models in language and vision, Graph Foundation Models (GFMs) hold significant promise for broad applicability across diverse graph tasks and domains. However, existing GFMs struggle with unstable few-shot…

November 11, 2025

A Feedback-Control Framework for Efficient Dataset Collection from In-Vehicle Data Streams

arXiv:2511.03239v2 Announce Type: replace Abstract: Modern AI systems are increasingly constrained not by model capacity but by the quality and diversity of their data. Despite growing emphasis on data-centric AI, most datasets are still gathered in an open-loop manner which…

November 11, 2025

Gradient Projection onto Historical Descent Directions for Communication-Efficient Federated Learning

arXiv:2511.05593v1 Announce Type: new Abstract: Federated Learning (FL) enables decentralized model training across multiple clients while optionally preserving data privacy. However, communication efficiency remains a critical bottleneck, particularly for large-scale models. In this work, we introduce two complementary algorithms: ProjFL,…

November 11, 2025

Learning Task Representations from In-Context Learning

arXiv:2502.05390v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated remarkable proficiency in in-context learning (ICL), where models adapt to new tasks through example-based prompts without requiring parameter updates. However, understanding how tasks are internally encoded and generalized remains…

November 11, 2025

Optimizing Predictive Maintenance in Intelligent Manufacturing: An Integrated FNO-DAE-GNN-PPO MDP Framework

arXiv:2511.05594v1 Announce Type: new Abstract: In the era of smart manufacturing, predictive maintenance (PdM) plays a pivotal role in improving equipment reliability and reducing operating costs. In this paper, we propose a novel Markov Decision Process (MDP) framework that integrates…

November 11, 2025

CultureGuard: Towards Culturally-Aware Dataset and Guard Model for Multilingual Safety Applications

arXiv:2508.01710v4 Announce Type: replace-cross Abstract: The increasing use of Large Language Models (LLMs) in agentic applications highlights the need for robust safety guard models. While content safety in English is well-studied, non-English languages lack similar advancements due to the high…

November 11, 2025