Archives AI News

ADARL: Adaptive Low-Rank Structures for Robust Policy Learning under Uncertainty

arXiv:2510.11899v1 Announce Type: new Abstract: Robust reinforcement learning (Robust RL) seeks to handle epistemic uncertainty in environment dynamics, but existing approaches often rely on nested min–max optimization, which is computationally expensive and yields overly conservative policies. We propose textbf{Adaptive Rank…

MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation

arXiv:2504.08621v2 Announce Type: replace Abstract: The Finite Element Method (FEM) is widely used in engineering and scientific computing, but its pre-processing, solver configuration, and post-processing stages are often time-consuming and require specialized knowledge. This paper proposes an automated solution framework,…

Attention as an Adaptive Filter

arXiv:2509.04154v3 Announce Type: replace Abstract: We introduce Adaptive Filter Attention (AFA), a novel attention mechanism that incorporates a learnable dynamics model directly into the computation of attention weights. Rather than comparing queries and keys directly, we model the input sequence…

ICL-Router: In-Context Learned Model Representations for LLM Routing

arXiv:2510.09719v2 Announce Type: replace Abstract: Large language models (LLMs) often exhibit complementary strengths. Model routing harnesses these strengths by dynamically directing each query to the most suitable model, given a candidate model pool. However, routing performance relies on accurate model…

Efficient Restarts in Non-Stationary Model-Free Reinforcement Learning

arXiv:2510.11933v1 Announce Type: new Abstract: In this work, we propose three efficient restart paradigms for model-free non-stationary reinforcement learning (RL). We identify two core issues with the restart design of Mao et al. (2022)’s RestartQ-UCB algorithm: (1) complete forgetting, where…

LLMBridge: Reducing Costs in a Prompt-Centric Internet

arXiv:2410.11857v2 Announce Type: replace-cross Abstract: Today’s Internet infrastructure is centered around content retrieval over HTTP, with middleboxes (e.g., HTTP proxies) playing a crucial role in performance, security, and cost-effectiveness. We envision a future where Internet communication will be dominated by…