Archives AI News

Structural Reward Model: Enhancing Interpretability, Efficiency, and Scalability in Reward Modeling

arXiv:2509.25361v1 Announce Type: new Abstract: Reward Models (RMs) are key components for evaluating and guiding language model outputs. However, traditional scalar RMs often struggle with incorporating contextual and background information during inference, leading to incomplete evaluations. Generative RMs (GRMs) attempt…

October 1, 2025

Multi-Robot Task Planning for Multi-Object Retrieval Tasks with Distributed On-Site Knowledge via Large Language Models

arXiv:2509.12838v2 Announce Type: replace-cross Abstract: It is crucial to efficiently execute instructions such as “Find an apple and a banana” or “Get ready for a field trip,” which require searching for multiple objects or understanding context-dependent commands. This study addresses…

October 1, 2025

Where LLM Agents Fail and How They can Learn From Failures

arXiv:2509.25370v1 Announce Type: new Abstract: Large Language Model (LLM) agents, which integrate planning, memory, reflection, and tool-use modules, have shown promise in solving complex, multi-step tasks. Yet their sophisticated architectures amplify vulnerability to cascading failures, where a single root-cause error…

October 1, 2025

Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning

arXiv:2509.25977v1 Announce Type: cross Abstract: Federated learning (FL) is a distributed learning paradigm across multiple entities while preserving data privacy. However, with the continuous emergence of new data and increasing model diversity, traditional federated learning faces significant challenges, including inherent…

October 1, 2025

From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models

arXiv:2509.25373v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) strive to achieve a profound, human-like understanding of and interaction with the physical world, but often exhibit a shallow and incoherent integration when acquiring information (Perception) and conducting reasoning (Cognition).…

October 1, 2025

Leveraging AI modelling for FDS with Simvue: monitor and optimise for more sustainable simulations

arXiv:2509.26139v1 Announce Type: cross Abstract: There is high demand on fire simulations, in both scale and quantity. We present a multi-pronged approach to improving the time and energy required to meet these demands. We show the ability of a custom…

October 1, 2025

Saliency Guided Longitudinal Medical Visual Question Answering

arXiv:2509.25374v1 Announce Type: new Abstract: Longitudinal medical visual question answering (Diff-VQA) requires comparing paired studies from different time points and answering questions about clinically meaningful changes. In this setting, the difference signal and the consistency of visual focus across time…

October 1, 2025

Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization

arXiv:2509.26281v1 Announce Type: cross Abstract: Driven by the growing need for Oriented Object Detection (OOD), learning from point annotations under a weakly-supervised framework has emerged as a promising alternative to costly and laborious manual labeling. In this paper, we discuss…

October 1, 2025

Boolean Satisfiability via Imitation Learning

arXiv:2509.25411v1 Announce Type: new Abstract: We propose ImitSAT, a branching policy for conflict-driven clause learning (CDCL) solvers based on imitation learning for the Boolean satisfiability problem (SAT). Unlike previous methods that predict instance-level signals to improve CDCL branching indirectly, or…

October 1, 2025

ACT: Agentic Classification Tree

arXiv:2509.26433v1 Announce Type: cross Abstract: When used in high-stakes settings, AI systems are expected to produce decisions that are transparent, interpretable, and auditable, a requirement increasingly expected by regulations. Decision trees such as CART provide clear and verifiable rules, but…

October 1, 2025