Archives AI News

A Survey on Code Generation with LLM-based Agents

arXiv:2508.00083v2 Announce Type: replace-cross Abstract: Code generation agents powered by large language models (LLMs) are revolutionizing the software development paradigm. Distinct from previous code generation techniques, code generation agents are characterized by three core features. 1) Autonomy: the ability to…

October 1, 2025

Enhancing Linear Attention with Residual Learning

arXiv:2509.25223v1 Announce Type: new Abstract: Linear attention offers a linear-time alternative to self-attention but often struggles to capture long-range patterns. We revisit linear attention through a prediction-correction lens and show that prevalent variants can be written as a combination of…

October 1, 2025

SAC Flow: Sample-Efficient Reinforcement Learning of Flow-Based Policies via Velocity-Reparameterized Sequential Modeling

arXiv:2509.25756v1 Announce Type: cross Abstract: Training expressive flow-based policies with off-policy reinforcement learning is notoriously unstable due to gradient pathologies in the multi-step action sampling process. We trace this instability to a fundamental connection: the flow rollout is algebraically equivalent…

October 1, 2025

AMLA: MUL by ADD in FlashAttention Rescaling

arXiv:2509.25224v1 Announce Type: new Abstract: Multi-head Latent Attention (MLA) significantly reduces KVCache memory usage in Large Language Models while introducing substantial computational overhead and intermediate variable expansion. This poses challenges for efficient hardware implementation — especially during the decode phase.…

October 1, 2025

GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts

arXiv:2509.26055v1 Announce Type: cross Abstract: This paper presents GaussEdit, a framework for adaptive 3D scene editing guided by text and image prompts. GaussEdit leverages 3D Gaussian Splatting as its backbone for scene representation, enabling convenient Region of Interest selection and…

October 1, 2025

MSCoD: An Enhanced Bayesian Updating Framework with Multi-Scale Information Bottleneck and Cooperative Attention for Structure-Based Drug Design

arXiv:2509.25225v1 Announce Type: new Abstract: Structure-Based Drug Design (SBDD) is a powerful strategy in computational drug discovery, utilizing three-dimensional protein structures to guide the design of molecules with improved binding affinity. However, capturing complex protein-ligand interactions across multiple scales remains…

October 1, 2025

PRPO: Paragraph-level Policy Optimization for Vision-Language Deepfake Detection

arXiv:2509.26272v1 Announce Type: cross Abstract: The rapid rise of synthetic media has made deepfake detection a critical challenge for online safety and trust. Progress remains constrained by the scarcity of large, high-quality datasets. Although multimodal large language models (LLMs) exhibit…

October 1, 2025

Integrated Forecasting of Marine Renewable Power: An Adaptively Bayesian-Optimized MVMD-LSTM Framework for Wind-Solar-Wave Energy

arXiv:2509.25226v1 Announce Type: new Abstract: Integrated wind-solar-wave marine energy systems hold broad promise for supplying clean electricity in offshore and coastal regions. By leveraging the spatiotemporal complementarity of multiple resources, such systems can effectively mitigate the intermittency and volatility of…

October 1, 2025

Regression Language Models for Code

arXiv:2509.26476v1 Announce Type: cross Abstract: We study code-to-metric regression: predicting numeric outcomes of code executions, a challenging task due to the open-ended nature of programming languages. While prior methods have resorted to heavy and domain-specific feature engineering, we show that…

October 1, 2025

Simple, Fast and Efficient Injective Manifold Density Estimation with Random Projections

arXiv:2509.25228v1 Announce Type: new Abstract: We introduce Random Projection Flows (RPFs), a principled framework for injective normalizing flows that leverages tools from random matrix theory and the geometry of random projections. RPFs employ random semi-orthogonal matrices, drawn from Haar-distributed orthogonal…

October 1, 2025