Archives AI News

SysMoBench: Evaluating AI on Formally Modeling Complex Real-World Systems

arXiv:2509.23130v1 Announce Type: new Abstract: Formal models are essential to specifying large, complex computer systems and verifying their correctness, but are notoriously expensive to write and maintain. Recent advances in generative AI show promise in generating certain forms of specifications.…

Fast Real-Time Pipeline for Robust Arm Gesture Recognition

arXiv:2509.25042v1 Announce Type: cross Abstract: This paper presents a real-time pipeline for dynamic arm gesture recognition based on OpenPose keypoint estimation, keypoint normalization, and a recurrent neural network classifier. The 1 x 1 normalization scheme and two feature representations (coordinate-…

MathBode: Frequency-Domain Fingerprints of LLM Mathematical Reasoning

arXiv:2509.23143v1 Announce Type: new Abstract: This paper presents MathBode, a dynamic diagnostic for mathematical reasoning in large language models (LLMs). Instead of one-shot accuracy, MathBode treats each parametric problem as a system: we drive a single parameter sinusoidally and fit…

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration

arXiv:2405.14314v4 Announce Type: replace Abstract: Grounding the reasoning ability of large language models (LLMs) for embodied tasks is challenging due to the complexity of the physical world. Especially, LLM planning for multi-agent collaboration requires communication of agents or credit assignment…

Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs

arXiv:2506.19923v3 Announce Type: replace Abstract: We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model,…

AI-Enhanced Distributed Channel Access for Collision Avoidance in Future Wi-Fi 8

arXiv:2509.23154v1 Announce Type: new Abstract: The exponential growth of wireless devices and stringent reliability requirements of emerging applications demand fundamental improvements in distributed channel access mechanisms for unlicensed bands. Current Wi-Fi systems, which rely on binary exponential backoff (BEB), suffer…

Ocassionally Secure: A Comparative Analysis of Code Generation Assistants

arXiv:2402.00689v2 Announce Type: replace-cross Abstract: $ $Large Language Models (LLMs) are being increasingly utilized in various applications, with code generations being a notable example. While previous research has shown that LLMs have the capability to generate both secure and insecure…