Archives AI News

Density-Based Algorithms for Corruption-Robust Contextual Search and Convex Optimization

arXiv:2206.07528v3 Announce Type: replace Abstract: We study the problem of contextual search, a generalization of binary search in higher dimensions, in the adversarial noise model. Let $d$ be the dimension of the problem, $T$ be the time horizon and $C$…

January 5, 2026

Support Vector Machine Kernels as Quantum Propagators

arXiv:2502.11153v3 Announce Type: replace-cross Abstract: Selecting optimal kernels for regression in physical systems remains a challenge, often relying on trial-and-error with standard functions. In this work, we establish a mathematical correspondence between support vector machine kernels and quantum propagators, demonstrating…

January 5, 2026

Benchmark Success, Clinical Failure: When Reinforcement Learning Optimizes for Benchmarks, Not Patients

arXiv:2512.23090v2 Announce Type: replace-cross Abstract: Recent Reinforcement Learning (RL) advances for Large Language Models (LLMs) have improved reasoning tasks, yet their resource-constrained application to medical imaging remains underexplored. We introduce ChexReason, a vision-language model trained via R1-style methodology (SFT followed…

January 5, 2026

Reinforcement Learning with Function Approximation for Non-Markov Processes

arXiv:2601.00151v1 Announce Type: new Abstract: We study reinforcement learning methods with linear function approximation under non-Markov state and cost processes. We first consider the policy evaluation method and show that the algorithm converges under suitable ergodicity conditions on the underlying…

January 5, 2026

Information-Theoretic Quality Metric of Low-Dimensional Embeddings

arXiv:2512.23981v2 Announce Type: replace Abstract: In this work we study the quality of low-dimensional embeddings from an explicitly information-theoretic perspective. We begin by noting that classical evaluation metrics such as stress, rank-based neighborhood criteria, or Local Procrustes quantify distortions in…

January 5, 2026

Dynamic Bayesian Optimization Framework for Instruction Tuning in Partial Differential Equation Discovery

arXiv:2601.00088v1 Announce Type: new Abstract: Large Language Models (LLMs) show promise for equation discovery, yet their outputs are highly sensitive to prompt phrasing, a phenomenon we term instruction brittleness. Static prompts cannot adapt to the evolving state of a multi-step…

January 5, 2026

GRL-SNAM: Geometric Reinforcement Learning with Path Differential Hamiltonians for Simultaneous Navigation and Mapping in Unknown Environments

arXiv:2601.00116v1 Announce Type: new Abstract: We present GRL-SNAM, a geometric reinforcement learning framework for Simultaneous Navigation and Mapping(SNAM) in unknown environments. A SNAM problem is challenging as it needs to design hierarchical or joint policies of multiple agents that control…

January 5, 2026

Exploration in the Limit

arXiv:2601.00084v1 Announce Type: new Abstract: In fixed-confidence best arm identification (BAI), the objective is to quickly identify the optimal option while controlling the probability of error below a desired threshold. Despite the plethora of BAI algorithms, existing methods typically fall…

January 5, 2026

IMBWatch — a Spatio-Temporal Graph Neural Network approach to detect Illicit Massage Business

arXiv:2601.00075v1 Announce Type: new Abstract: Illicit Massage Businesses (IMBs) are a covert and persistent form of organized exploitation that operate under the facade of legitimate wellness services while facilitating human trafficking, sexual exploitation, and coerced labor. Detecting IMBs is difficult…

January 5, 2026

The Trojan in the Vocabulary: Stealthy Sabotage of LLM Composition

arXiv:2601.00065v1 Announce Type: new Abstract: The open-weight LLM ecosystem is increasingly defined by model composition techniques (such as weight merging, speculative decoding, and vocabulary expansion) that remix capabilities from diverse sources. A critical prerequisite for applying these methods across different…

January 5, 2026