Archives AI News

Hierarchical Reasoning Model: A Critical Supplementary Material

arXiv:2510.00355v1 Announce Type: new Abstract: Transformers have demonstrated remarkable performance in natural language processing and related domains, as they largely focus on sequential, autoregressive next-token prediction tasks. Yet, they struggle in logical reasoning, not necessarily because of a fundamental limitation…

Rethinking Thinking Tokens: LLMs as Improvement Operators

arXiv:2510.01123v1 Announce Type: cross Abstract: Reasoning training incentivizes LLMs to produce long chains of thought (long CoT), which among other things, allows them to explore solution strategies with self-checking. This results in higher accuracy, but inflates context length, token/compute cost,…

Semantic-Driven AI Agent Communications: Challenges and Solutions

arXiv:2510.00381v1 Announce Type: new Abstract: With the rapid growth of intelligent services, communication targets are shifting from humans to artificial intelligent (AI) agents, which require new paradigms to enable real-time perception, decision-making, and collaboration. Semantic communication, which conveys task-relevant meaning…

Balancing Multimodal Training Through Game-Theoretic Regularization

arXiv:2411.07335v3 Announce Type: replace-cross Abstract: Multimodal learning holds promise for richer information extraction by capturing dependencies across data sources. Yet, current training methods often underperform due to modality competition, a phenomenon where modalities contend for training resources leaving some underoptimized.…

Rethinking Reward Models for Multi-Domain Test-Time Scaling

arXiv:2510.00492v1 Announce Type: new Abstract: The reliability of large language models (LLMs) during test-time scaling is often assessed with emph{external verifiers} or emph{reward models} that distinguish correct reasoning from flawed logic. Prior work generally assumes that process reward models (PRMs),…