Archives AI News

VTPerception-R1: Enhancing Multimodal Reasoning via Explicit Visual and Textual Perceptual Grounding

arXiv:2509.24776v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) often struggle to ground reasoning in perceptual evidence. We present a systematic study of perception strategies-explicit, implicit, visual, and textual-across four multimodal benchmarks and two MLLMs. Our findings show that…

September 30, 2025

SysMoBench: Evaluating AI on Formally Modeling Complex Real-World Systems

arXiv:2509.23130v1 Announce Type: new Abstract: Formal models are essential to specifying large, complex computer systems and verifying their correctness, but are notoriously expensive to write and maintain. Recent advances in generative AI show promise in generating certain forms of specifications.…

September 30, 2025

Fast Real-Time Pipeline for Robust Arm Gesture Recognition

arXiv:2509.25042v1 Announce Type: cross Abstract: This paper presents a real-time pipeline for dynamic arm gesture recognition based on OpenPose keypoint estimation, keypoint normalization, and a recurrent neural network classifier. The 1 x 1 normalization scheme and two feature representations (coordinate-…

September 30, 2025

MathBode: Frequency-Domain Fingerprints of LLM Mathematical Reasoning

arXiv:2509.23143v1 Announce Type: new Abstract: This paper presents MathBode, a dynamic diagnostic for mathematical reasoning in large language models (LLMs). Instead of one-shot accuracy, MathBode treats each parametric problem as a system: we drive a single parameter sinusoidally and fit…

September 30, 2025

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration

arXiv:2405.14314v4 Announce Type: replace Abstract: Grounding the reasoning ability of large language models (LLMs) for embodied tasks is challenging due to the complexity of the physical world. Especially, LLM planning for multi-agent collaboration requires communication of agents or credit assignment…

September 30, 2025

Coordination Requires Simplification: Thermodynamic Bounds on Multi-Objective Compromise in Natural and Artificial Intelligence

arXiv:2509.23144v1 Announce Type: new Abstract: Information-processing systems coordinating across multiple agents and objectives face fundamental thermodynamic constraints. We show that solutions with maximum utility to act as coordination focal points have much higher selection pressure for being findable across agents…

September 30, 2025

Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs

arXiv:2506.19923v3 Announce Type: replace Abstract: We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model,…

September 30, 2025

AI-Enhanced Distributed Channel Access for Collision Avoidance in Future Wi-Fi 8

arXiv:2509.23154v1 Announce Type: new Abstract: The exponential growth of wireless devices and stringent reliability requirements of emerging applications demand fundamental improvements in distributed channel access mechanisms for unlicensed bands. Current Wi-Fi systems, which rely on binary exponential backoff (BEB), suffer…

September 30, 2025

Ocassionally Secure: A Comparative Analysis of Code Generation Assistants

arXiv:2402.00689v2 Announce Type: replace-cross Abstract: $ $Large Language Models (LLMs) are being increasingly utilized in various applications, with code generations being a notable example. While previous research has shown that LLMs have the capability to generate both secure and insecure…

September 30, 2025

Limit Analysis for Symbolic Multi-step Reasoning Tasks with Information Propagation Rules Based on Transformers

arXiv:2509.23178v1 Announce Type: new Abstract: Transformers are able to perform reasoning tasks, however the intrinsic mechanism remains widely open. In this paper we propose a set of information propagation rules based on Transformers and utilize symbolic reasoning tasks to theoretically…

September 30, 2025