Archives AI News

Where LLM Agents Fail and How They can Learn From Failures

arXiv:2509.25370v1 Announce Type: new Abstract: Large Language Model (LLM) agents, which integrate planning, memory, reflection, and tool-use modules, have shown promise in solving complex, multi-step tasks. Yet their sophisticated architectures amplify vulnerability to cascading failures, where a single root-cause error…

Saliency Guided Longitudinal Medical Visual Question Answering

arXiv:2509.25374v1 Announce Type: new Abstract: Longitudinal medical visual question answering (Diff-VQA) requires comparing paired studies from different time points and answering questions about clinically meaningful changes. In this setting, the difference signal and the consistency of visual focus across time…

Boolean Satisfiability via Imitation Learning

arXiv:2509.25411v1 Announce Type: new Abstract: We propose ImitSAT, a branching policy for conflict-driven clause learning (CDCL) solvers based on imitation learning for the Boolean satisfiability problem (SAT). Unlike previous methods that predict instance-level signals to improve CDCL branching indirectly, or…

ACT: Agentic Classification Tree

arXiv:2509.26433v1 Announce Type: cross Abstract: When used in high-stakes settings, AI systems are expected to produce decisions that are transparent, interpretable, and auditable, a requirement increasingly expected by regulations. Decision trees such as CART provide clear and verifiable rules, but…