Archives AI News

The Trojan in the Vocabulary: Stealthy Sabotage of LLM Composition

arXiv:2601.00065v1 Announce Type: new Abstract: The open-weight LLM ecosystem is increasingly defined by model composition techniques (such as weight merging, speculative decoding, and vocabulary expansion) that remix capabilities from diverse sources. A critical prerequisite for applying these methods across different…

January 5, 2026

Homogenization with Guaranteed Bounds via Primal-Dual Physically Informed Neural Networks

arXiv:2509.07579v2 Announce Type: replace Abstract: Physics-informed neural networks (PINNs) have shown promise in solving partial differential equations (PDEs) relevant to multiscale modeling, but they often fail when applied to materials with discontinuous coefficients, such as media with piecewise constant properties.…

January 5, 2026

The Weather Paradox: Why Precipitation Fails to Predict Traffic Accident Severity in Large-Scale US Data

arXiv:2601.00152v1 Announce Type: new Abstract: This study investigates the predictive capacity of environmental, temporal, and spatial factors on traffic accident severity in the United States. Using a dataset of 500,000 U.S. traffic accidents spanning 2016-2023, we trained an XGBoost classifier…

January 5, 2026

Scaling Patterns in Adversarial Alignment: Evidence from Multi-LLM Jailbreak Experiments

arXiv:2511.13788v2 Announce Type: replace Abstract: Large language models (LLMs) increasingly operate in multi-agent and safety-critical settings, raising open questions about how their vulnerabilities scale when models interact adversarially. This study examines whether larger models can systematically jailbreak smaller ones –…

January 5, 2026

Online Finetuning Decision Transformers with Pure RL Gradients

arXiv:2601.00167v1 Announce Type: new Abstract: Decision Transformers (DTs) have emerged as a powerful framework for sequential decision making by formulating offline reinforcement learning (RL) as a sequence modeling problem. However, extending DTs to online settings with pure RL gradients remains…

January 5, 2026

CIC: Circular Image Compression

arXiv:2407.15870v4 Announce Type: replace-cross Abstract: Learned image compression (LIC) is currently the cutting-edge method. However, the inherent difference between testing and training images of LIC results in performance degradation to some extent. Especially for out-of-sample, out-of-distribution, or out-of-domain testing images,…

January 5, 2026

Sequential Reservoir Computing for Efficient High-Dimensional Spatiotemporal Forecasting

arXiv:2601.00172v1 Announce Type: new Abstract: Forecasting high-dimensional spatiotemporal systems remains computationally challenging for recurrent neural networks (RNNs) and long short-term memory (LSTM) models due to gradient-based training and memory bottlenecks. Reservoir Computing (RC) mitigates these challenges by replacing backpropagation with…

January 5, 2026

An Analytical and AI-discovered Stable, Accurate, and Generalizable Subgrid-scale Closure for Geophysical Turbulence

arXiv:2509.20365v3 Announce Type: replace-cross Abstract: By combining AI and fluid physics, we discover a closed-form closure for 2D turbulence from small direct numerical simulation (DNS) data. Large-eddy simulation (LES) with this closure is accurate and stable, reproducing DNS statistics including…

January 5, 2026

Early Prediction of Liver Cirrhosis Up to Three Years in Advance: A Machine Learning Study Benchmarking Against the FIB-4 Score

arXiv:2601.00175v1 Announce Type: new Abstract: Objective: Develop and evaluate machine learning (ML) models for predicting incident liver cirrhosis one, two, and three years prior to diagnosis using routinely collected electronic health record (EHR) data, and to benchmark their performance against…

January 5, 2026

RMAAT: Astrocyte-Inspired Memory Compression and Replay for Efficient Long-Context Transformers

arXiv:2601.00426v1 Announce Type: cross Abstract: The quadratic complexity of self-attention mechanism presents a significant impediment to applying Transformer models to long sequences. This work explores computational principles derived from astrocytes-glial cells critical for biological memory and synaptic modulation-as a complementary…

January 5, 2026