Archives AI News

From Solving to Verifying: A Unified Objective for Robust Reasoning in LLMs

arXiv:2511.15137v1 Announce Type: new Abstract: The reasoning capabilities of large language models (LLMs) have been significantly improved through reinforcement learning (RL). Nevertheless, LLMs still struggle to consistently verify their own reasoning traces. This raises the research question of how to…

Cross-Modal Consistency-Guided Active Learning for Affective BCI Systems

arXiv:2511.15138v1 Announce Type: new Abstract: Deep learning models perform best with abundant, high-quality labels, yet such conditions are rarely achievable in EEG-based emotion recognition. Electroencephalogram (EEG) signals are easily corrupted by artifacts and individual variability, while emotional labels often stem…

VisPlay: Self-Evolving Vision-Language Models from Images

arXiv:2511.15661v1 Announce Type: cross Abstract: Reinforcement learning (RL) provides a principled framework for improving Vision-Language Models (VLMs) on complex reasoning tasks. However, existing RL approaches often rely on human-annotated labels or task-specific heuristics to define verifiable rewards, both of which…

$pi^{*}_{0.6}$: a VLA That Learns From Experience

arXiv:2511.14759v2 Announce Type: replace Abstract: We study how vision-language-action (VLA) models can improve through real-world deployments via reinforcement learning (RL). We present a general-purpose method, RL with Experience and Corrections via Advantage-conditioned Policies (RECAP), that provides for RL training of…