Archives AI News

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

arXiv:2511.14846v1 Announce Type: new Abstract: Training Large Language Models (LLMs) for multi-turn Tool-Integrated Reasoning (TIR) – where models iteratively reason, generate code, and verify through execution – remains challenging for existing reinforcement learning (RL) approaches. Current RL methods, exemplified by…

Explaining Time Series Classification Predictions via Causal Attributions

arXiv:2405.15871v2 Announce Type: replace Abstract: Despite the excelling performance of machine learning models, understanding their decisions remains a long-standing goal. Although commonly used attribution methods from explainable AI attempt to address this issue, they typically rely on associational rather than…

Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion

arXiv:2503.10488v3 Announce Type: replace Abstract: Generating co-speech gestures in real time requires both temporal coherence and efficient sampling. We introduce a novel framework for streaming gesture generation that extends Rolling Diffusion models with structured progressive noise scheduling, enabling seamless long-sequence…

Coresets from Trajectories: Selecting Data via Correlation of Loss Differences

arXiv:2508.20230v2 Announce Type: replace Abstract: Deep learning models achieve state-of-the-art performance across domains but face scalability challenges in real-time or resource-constrained scenarios. To address this, we propose Correlation of Loss Differences (CLD), a simple and scalable metric for coreset selection…

VisPlay: Self-Evolving Vision-Language Models from Images

arXiv:2511.15661v1 Announce Type: cross Abstract: Reinforcement learning (RL) provides a principled framework for improving Vision-Language Models (VLMs) on complex reasoning tasks. However, existing RL approaches often rely on human-annotated labels or task-specific heuristics to define verifiable rewards, both of which…