Archives AI News

A Survey on Code Generation with LLM-based Agents

arXiv:2508.00083v2 Announce Type: replace-cross Abstract: Code generation agents powered by large language models (LLMs) are revolutionizing the software development paradigm. Distinct from previous code generation techniques, code generation agents are characterized by three core features. 1) Autonomy: the ability to…

Enhancing Linear Attention with Residual Learning

arXiv:2509.25223v1 Announce Type: new Abstract: Linear attention offers a linear-time alternative to self-attention but often struggles to capture long-range patterns. We revisit linear attention through a prediction-correction lens and show that prevalent variants can be written as a combination of…

AMLA: MUL by ADD in FlashAttention Rescaling

arXiv:2509.25224v1 Announce Type: new Abstract: Multi-head Latent Attention (MLA) significantly reduces KVCache memory usage in Large Language Models while introducing substantial computational overhead and intermediate variable expansion. This poses challenges for efficient hardware implementation — especially during the decode phase.…

GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts

arXiv:2509.26055v1 Announce Type: cross Abstract: This paper presents GaussEdit, a framework for adaptive 3D scene editing guided by text and image prompts. GaussEdit leverages 3D Gaussian Splatting as its backbone for scene representation, enabling convenient Region of Interest selection and…

Regression Language Models for Code

arXiv:2509.26476v1 Announce Type: cross Abstract: We study code-to-metric regression: predicting numeric outcomes of code executions, a challenging task due to the open-ended nature of programming languages. While prior methods have resorted to heavy and domain-specific feature engineering, we show that…