Archives AI News

Attention as an Adaptive Filter

arXiv:2509.04154v3 Announce Type: replace Abstract: We introduce Adaptive Filter Attention (AFA), a novel attention mechanism that incorporates a learnable dynamics model directly into the computation of attention weights. Rather than comparing queries and keys directly, we model the input sequence…

ICL-Router: In-Context Learned Model Representations for LLM Routing

arXiv:2510.09719v2 Announce Type: replace Abstract: Large language models (LLMs) often exhibit complementary strengths. Model routing harnesses these strengths by dynamically directing each query to the most suitable model, given a candidate model pool. However, routing performance relies on accurate model…

Efficient Restarts in Non-Stationary Model-Free Reinforcement Learning

arXiv:2510.11933v1 Announce Type: new Abstract: In this work, we propose three efficient restart paradigms for model-free non-stationary reinforcement learning (RL). We identify two core issues with the restart design of Mao et al. (2022)’s RestartQ-UCB algorithm: (1) complete forgetting, where…

LLMBridge: Reducing Costs in a Prompt-Centric Internet

arXiv:2410.11857v2 Announce Type: replace-cross Abstract: Today’s Internet infrastructure is centered around content retrieval over HTTP, with middleboxes (e.g., HTTP proxies) playing a crucial role in performance, security, and cost-effectiveness. We envision a future where Internet communication will be dominated by…

On efficiently computable functions, deep networks and sparse compositionality

arXiv:2510.11942v1 Announce Type: new Abstract: We show that emph{efficient Turing computability} at any fixed input/output precision implies the existence of emph{compositionally sparse} (bounded-fan-in, polynomial-size) DAG representations and of corresponding neural approximants achieving the target precision. Concretely: if $f:[0,1]^dtoR^m$ is computable…

Inverse Design in Nanophotonics via Representation Learning

arXiv:2507.00546v2 Announce Type: replace-cross Abstract: Inverse design in nanophotonics, the computational discovery of structures achieving targeted electromagnetic (EM) responses, has become a key tool for recent optical advances. Traditional intuition-driven or iterative optimization methods struggle with the inherently high-dimensional, non-convex…