Archives AI News

Contextualize-then-Aggregate: Circuits for In-Context Learning in Gemma-2 2B

arXiv:2504.00132v4 Announce Type: replace-cross Abstract: In-Context Learning (ICL) is an intriguing ability of large language models (LLMs). Despite a substantial amount of work on its behavioral aspects and how it emerges in miniature setups, it remains unclear which mechanism assembles…

Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning

arXiv:2508.18397v2 Announce Type: replace-cross Abstract: Offline Reinforcement Learning (RL) presents a promising paradigm for training autonomous vehicle (AV) planning policies from large-scale, real-world driving logs. However, the extreme data imbalance in these logs, where mundane scenarios vastly outnumber rare “long-tail”…

Enabling Local Neural Operators to perform Equation-Free System-Level Analysis

arXiv:2505.02308v2 Announce Type: replace Abstract: Neural Operators (NOs) provide a powerful framework for computations involving physical laws that can be modelled by (integro-) partial differential equations (PDEs), directly learning maps between infinite-dimensional function spaces that bypass both the explicit equation…

HAM: Hierarchical Adapter Merging for Scalable Continual Learning

arXiv:2509.13211v2 Announce Type: replace Abstract: Continual learning is an essential capability of human cognition, yet it poses significant challenges for current deep learning models. The primary issue is that new knowledge can interfere with previously learned information, causing the model…

Privacy-Aware In-Context Learning for Large Language Models

arXiv:2509.13625v1 Announce Type: new Abstract: Large language models (LLMs) have significantly transformed natural language understanding and generation, but they raise privacy concerns due to potential exposure of sensitive information. Studies have highlighted the risk of information leakage, where adversaries can…

Unsupervised Anomaly Detection in ALS EPICS Event Logs

arXiv:2509.13621v1 Announce Type: new Abstract: This paper introduces an automated fault analysis framework for the Advanced Light Source (ALS) that processes real-time event logs from its EPICS control system. By treating log entries as natural language, we transform them into…

Meta-Learning Linear Models for Molecular Property Prediction

arXiv:2509.13527v1 Announce Type: new Abstract: Chemists in search of structure-property relationships face great challenges due to limited high quality, concordant datasets. Machine learning (ML) has significantly advanced predictive capabilities in chemical sciences, but these modern data-driven approaches have increased the…

AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions

arXiv:2509.13523v1 Announce Type: new Abstract: Generative machine learning offers new opportunities to better understand complex Earth system dynamics. Recent diffusion-based methods address spectral biases and improve ensemble calibration in weather forecasting compared to deterministic methods, yet have so far proven…