Archives AI News

Dynamic Bayesian Optimization Framework for Instruction Tuning in Partial Differential Equation Discovery

arXiv:2601.00088v1 Announce Type: new Abstract: Large Language Models (LLMs) show promise for equation discovery, yet their outputs are highly sensitive to prompt phrasing, a phenomenon we term instruction brittleness. Static prompts cannot adapt to the evolving state of a multi-step…

January 5, 2026

GRL-SNAM: Geometric Reinforcement Learning with Path Differential Hamiltonians for Simultaneous Navigation and Mapping in Unknown Environments

arXiv:2601.00116v1 Announce Type: new Abstract: We present GRL-SNAM, a geometric reinforcement learning framework for Simultaneous Navigation and Mapping(SNAM) in unknown environments. A SNAM problem is challenging as it needs to design hierarchical or joint policies of multiple agents that control…

January 5, 2026

Exploration in the Limit

arXiv:2601.00084v1 Announce Type: new Abstract: In fixed-confidence best arm identification (BAI), the objective is to quickly identify the optimal option while controlling the probability of error below a desired threshold. Despite the plethora of BAI algorithms, existing methods typically fall…

January 5, 2026

IMBWatch — a Spatio-Temporal Graph Neural Network approach to detect Illicit Massage Business

arXiv:2601.00075v1 Announce Type: new Abstract: Illicit Massage Businesses (IMBs) are a covert and persistent form of organized exploitation that operate under the facade of legitimate wellness services while facilitating human trafficking, sexual exploitation, and coerced labor. Detecting IMBs is difficult…

January 5, 2026

The Trojan in the Vocabulary: Stealthy Sabotage of LLM Composition

arXiv:2601.00065v1 Announce Type: new Abstract: The open-weight LLM ecosystem is increasingly defined by model composition techniques (such as weight merging, speculative decoding, and vocabulary expansion) that remix capabilities from diverse sources. A critical prerequisite for applying these methods across different…

January 5, 2026

Homogenization with Guaranteed Bounds via Primal-Dual Physically Informed Neural Networks

arXiv:2509.07579v2 Announce Type: replace Abstract: Physics-informed neural networks (PINNs) have shown promise in solving partial differential equations (PDEs) relevant to multiscale modeling, but they often fail when applied to materials with discontinuous coefficients, such as media with piecewise constant properties.…

January 5, 2026

The Weather Paradox: Why Precipitation Fails to Predict Traffic Accident Severity in Large-Scale US Data

arXiv:2601.00152v1 Announce Type: new Abstract: This study investigates the predictive capacity of environmental, temporal, and spatial factors on traffic accident severity in the United States. Using a dataset of 500,000 U.S. traffic accidents spanning 2016-2023, we trained an XGBoost classifier…

January 5, 2026

Scaling Patterns in Adversarial Alignment: Evidence from Multi-LLM Jailbreak Experiments

arXiv:2511.13788v2 Announce Type: replace Abstract: Large language models (LLMs) increasingly operate in multi-agent and safety-critical settings, raising open questions about how their vulnerabilities scale when models interact adversarially. This study examines whether larger models can systematically jailbreak smaller ones –…

January 5, 2026

Online Finetuning Decision Transformers with Pure RL Gradients

arXiv:2601.00167v1 Announce Type: new Abstract: Decision Transformers (DTs) have emerged as a powerful framework for sequential decision making by formulating offline reinforcement learning (RL) as a sequence modeling problem. However, extending DTs to online settings with pure RL gradients remains…

January 5, 2026

CIC: Circular Image Compression

arXiv:2407.15870v4 Announce Type: replace-cross Abstract: Learned image compression (LIC) is currently the cutting-edge method. However, the inherent difference between testing and training images of LIC results in performance degradation to some extent. Especially for out-of-sample, out-of-distribution, or out-of-domain testing images,…

January 5, 2026