Archives AI News

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

arXiv:2602.20156v3 Announce Type: replace-cross Abstract: LLM agents are evolving rapidly, powered by code execution, tools, and the recently introduced agent skills feature. Skills allow users to extend LLM applications with specialized third-party code, knowledge, and instructions. Although this can extend…

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

arXiv:2602.21320v1 Announce Type: new Abstract: Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically…

Parallel Split Learning with Global Sampling

arXiv:2407.15738v5 Announce Type: replace Abstract: Distributed deep learning in resource-constrained environments faces scalability and generalization challenges due to large effective batch sizes and non-identically distributed client data. We introduce a server-driven sampling strategy that maintains a fixed global batch size…

Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning

arXiv:2506.21427v3 Announce Type: replace Abstract: Generative models such as diffusion and flow-matching offer expressive policies for offline reinforcement learning (RL) by capturing rich, multimodal action distributions, but their iterative sampling introduces high inference costs and training instability due to gradient…

Equitable Evaluation via Elicitation

arXiv:2602.21327v1 Announce Type: new Abstract: Individuals with similar qualifications and skills may vary in their demeanor, or outward manner: some tend toward self-promotion while others are modest to the point of omitting crucial information. Comparing the self-descriptions of equally qualified…

Rethinking Consistent Multi-Label Classification Under Inexact Supervision

arXiv:2510.04091v2 Announce Type: replace Abstract: Partial multi-label learning and complementary multi-label learning are two popular weakly supervised multi-label classification paradigms that aim to alleviate the high annotation costs of collecting precisely annotated multi-label data. In partial multi-label learning, each instance…

Efficient Opportunistic Approachability

arXiv:2602.21328v1 Announce Type: new Abstract: We study the problem of opportunistic approachability: a generalization of Blackwell approachability where the learner would like to obtain stronger guarantees (i.e., approach a smaller set) when their adversary limits themselves to a subset of…