Archives AI News

Reinforcement Learning with Function Approximation for Non-Markov Processes

arXiv:2601.00151v1 Announce Type: new Abstract: We study reinforcement learning methods with linear function approximation under non-Markov state and cost processes. We first consider the policy evaluation method and show that the algorithm converges under suitable ergodicity conditions on the underlying…

January 5, 2026

Information-Theoretic Quality Metric of Low-Dimensional Embeddings

arXiv:2512.23981v2 Announce Type: replace Abstract: In this work we study the quality of low-dimensional embeddings from an explicitly information-theoretic perspective. We begin by noting that classical evaluation metrics such as stress, rank-based neighborhood criteria, or Local Procrustes quantify distortions in…

January 5, 2026

Dynamic Bayesian Optimization Framework for Instruction Tuning in Partial Differential Equation Discovery

arXiv:2601.00088v1 Announce Type: new Abstract: Large Language Models (LLMs) show promise for equation discovery, yet their outputs are highly sensitive to prompt phrasing, a phenomenon we term instruction brittleness. Static prompts cannot adapt to the evolving state of a multi-step…

January 5, 2026

GRL-SNAM: Geometric Reinforcement Learning with Path Differential Hamiltonians for Simultaneous Navigation and Mapping in Unknown Environments

arXiv:2601.00116v1 Announce Type: new Abstract: We present GRL-SNAM, a geometric reinforcement learning framework for Simultaneous Navigation and Mapping(SNAM) in unknown environments. A SNAM problem is challenging as it needs to design hierarchical or joint policies of multiple agents that control…

January 5, 2026

Exploration in the Limit

arXiv:2601.00084v1 Announce Type: new Abstract: In fixed-confidence best arm identification (BAI), the objective is to quickly identify the optimal option while controlling the probability of error below a desired threshold. Despite the plethora of BAI algorithms, existing methods typically fall…

January 5, 2026

IMBWatch — a Spatio-Temporal Graph Neural Network approach to detect Illicit Massage Business

arXiv:2601.00075v1 Announce Type: new Abstract: Illicit Massage Businesses (IMBs) are a covert and persistent form of organized exploitation that operate under the facade of legitimate wellness services while facilitating human trafficking, sexual exploitation, and coerced labor. Detecting IMBs is difficult…

January 5, 2026

The Trojan in the Vocabulary: Stealthy Sabotage of LLM Composition

arXiv:2601.00065v1 Announce Type: new Abstract: The open-weight LLM ecosystem is increasingly defined by model composition techniques (such as weight merging, speculative decoding, and vocabulary expansion) that remix capabilities from diverse sources. A critical prerequisite for applying these methods across different…

January 5, 2026

Homogenization with Guaranteed Bounds via Primal-Dual Physically Informed Neural Networks

arXiv:2509.07579v2 Announce Type: replace Abstract: Physics-informed neural networks (PINNs) have shown promise in solving partial differential equations (PDEs) relevant to multiscale modeling, but they often fail when applied to materials with discontinuous coefficients, such as media with piecewise constant properties.…

January 5, 2026

New research may help scientists predict when a humid heat wave will break

As these events become more common at midlatitudes, a phenomenon called an atmospheric inversion will determine how long they last.

January 5, 2026

The Curse of Depth in Large Language Models

arXiv:2502.05795v3 Announce Type: replace Abstract: In this paper, we introduce the Curse of Depth, a concept that highlights, explains, and addresses the recent observation in modern Large Language Models (LLMs) where nearly half of the layers are less effective than…

January 5, 2026