Archives AI News

SIFThinker: Spatially-Aware Image Focus for Visual Reasoning

arXiv:2508.06259v4 Announce Type: replace-cross Abstract: Current multimodal large language models (MLLMs) still face significant challenges in complex visual tasks (e.g., spatial understanding, fine-grained perception). Prior methods have tried to incorporate visual reasoning, however, they fail to leverage attention correction with…

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

arXiv:2502.12067v3 Announce Type: replace-cross Abstract: Chain-of-Thought (CoT) has been proven effective in enhancing the reasoning capabilities of large language models (LLMs). Recent advancements, such as OpenAI’s o1 and DeepSeek-R1, suggest that scaling up the length of CoT sequences during inference…

Rich Vehicle Routing Problem with diverse Vertices allowing Hierarchical and Multimodal Time-Dependant Transhipment of multiple Node- Vehicle- compatible Cargo with Cascaded Time-Minimization Objective for Emergency Decision Support Systems

arXiv:2509.13227v1 Announce Type: cross Abstract: A rich vehicle routing problem is considered allowing multiple trips of heterogeneous vehicles stationed at distributed vehicle depots spread across diverse geographies having access to different modes of transportation. The problem arises from the real…

Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization

arXiv:2509.12434v1 Announce Type: new Abstract: Software engineering presents complex, multi-step challenges for Large Language Models (LLMs), requiring reasoning over large codebases and coordinated tool use. The difficulty of these tasks is exemplified by benchmarks like SWE-bench, where current LLMs still…