Archives AI News

GPO: Learning from Critical Steps to Improve LLM Reasoning

arXiv:2509.16456v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used in various domains, showing impressive potential on different tasks. Recently, reasoning LLMs have been proposed to improve the textit{reasoning} or textit{thinking} capabilities of LLMs to solve complex problems.…

September 23, 2025

Full-History Graphs with Edge-Type Decoupled Networks for Temporal Reasoning

arXiv:2508.03251v2 Announce Type: replace Abstract: Modeling evolving interactions among entities is critical in many real-world tasks. For example, predicting driver maneuvers in traffic requires tracking how neighboring vehicles accelerate, brake, and change lanes relative to one another over consecutive frames.…

September 23, 2025

Checking extracted rules in Neural Networks

arXiv:2509.16547v1 Announce Type: new Abstract: In this paper we investigate formal verification of extracted rules for Neural Networks under a complexity theoretic point of view. A rule is a global property or a pattern concerning a large portion of the…

September 23, 2025

Bayesian scaling laws for in-context learning

arXiv:2410.16531v4 Announce Type: replace-cross Abstract: In-context learning (ICL) is a powerful technique for getting language models to perform complex tasks with no training updates. Prior work has established strong correlations between the number of in-context examples provided and the accuracy…

September 23, 2025

SalaMAnder: Shapley-based Mathematical Expression Attribution and Metric for Chain-of-Thought Reasoning

arXiv:2509.16561v1 Announce Type: new Abstract: Chain-of-Thought (CoT) prompting enhances the math reasoning capability of large language models (LLMs) to a large margin. However, the mechanism underlying such improvements remains unexplored. In this paper, we present textbf{SalaMAnder} (textbf{S}htextbf{a}ptextbf{l}ey-btextbf{a}sed textbf{M}athematical Expression textbf{A}ttribution…

September 23, 2025

Can Language Models Follow Multiple Turns of Entangled Instructions?

arXiv:2503.13222v3 Announce Type: replace-cross Abstract: Despite significant achievements in improving the instruction-following capabilities of large language models (LLMs), the ability to process multiple potentially entangled or conflicting instructions remains a considerable challenge. Real-world scenarios often require consistency across multiple instructions…

September 23, 2025

Zero-Shot Human Mobility Forecasting via Large Language Model with Hierarchical Reasoning

arXiv:2509.16578v1 Announce Type: new Abstract: Human mobility forecasting is important for applications such as transportation planning, urban management, and personalized recommendations. However, existing methods often fail to generalize to unseen users or locations and struggle to capture dynamic intent due…

September 23, 2025

Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments

arXiv:2505.17616v2 Announce Type: replace-cross Abstract: Agents powered by large language models (LLMs) have demonstrated strong planning and decision-making capabilities in complex embodied environments. However, such agents often suffer from inefficiencies in multi-turn interactions, frequently trapped in repetitive loops or issuing…

September 23, 2025

Question Answering with LLMs and Learning from Answer Sets

arXiv:2509.16590v1 Announce Type: new Abstract: Large Language Models (LLMs) excel at understanding natural language but struggle with explicit commonsense reasoning. A recent trend of research suggests that the combination of LLM with robust symbolic reasoning systems can overcome this problem…

September 23, 2025

Enhancing Live Broadcast Engagement: A Multi-modal Approach to Short Video Recommendations Using MMGCN and User Preferences

arXiv:2506.23085v2 Announce Type: replace-cross Abstract: The purpose of this paper is to explore a multi-modal approach to enhancing live broadcast engagement by developing a short video recommendation system that incorporates Multi-modal Graph Convolutional Networks (MMGCN) with user preferences. To provide…

September 23, 2025