Archives AI News

Positional Encoding via Token-Aware Phase Attention

arXiv:2509.12635v2 Announce Type: replace-cross Abstract: We prove under practical assumptions that Rotary Positional Embedding (RoPE) introduces an intrinsic distance-dependent bias in attention scores that limits RoPE’s ability to model long-context. RoPE extension methods may alleviate this issue, but they typically…

DS-STAR: Data Science Agent via Iterative Planning and Verification

arXiv:2509.21825v1 Announce Type: new Abstract: Data science, which transforms raw data into actionable insights, is critical for data-driven decision-making. However, these tasks are often complex, involving steps for exploring multiple data sources and synthesizing findings to deliver insightful answers. While…

Explaining multimodal LLMs via intra-modal token interactions

arXiv:2509.22415v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have achieved remarkable success across diverse vision-language tasks, yet their internal decision-making mechanisms remain insufficiently understood. Existing interpretability research has primarily focused on cross-modal attribution, identifying which image regions the…

Axiomatic Choice and the Decision-Evaluation Paradox

arXiv:2509.21836v1 Announce Type: new Abstract: We introduce a framework for modeling decisions with axioms that are statements about decisions, e.g., ethical constraints. Using our framework we define a taxonomy of decision axioms based on their structural properties and demonstrate a…

Reimagining Agent-based Modeling with Large Language Model Agents via Shachi

arXiv:2509.21862v1 Announce Type: new Abstract: The study of emergent behaviors in large language model (LLM)-driven multi-agent systems is a critical research challenge, yet progress is limited by a lack of principled methodologies for controlled experimentation. To address this, we introduce…

XBOUND: Exploring Capability Boundaries of Device-Control Agents at the State Level

arXiv:2505.21279v2 Announce Type: replace Abstract: Recent advancements in vision-language models have increased interest in Device-Control Agents (DC agents) for managing graphical user interfaces (GUIs). With the growing complexity and integration of such agents into various applications, effective evaluation methods have…