Archives AI News

Contextualize-then-Aggregate: Circuits for In-Context Learning in Gemma-2 2B

arXiv:2504.00132v4 Announce Type: replace-cross Abstract: In-Context Learning (ICL) is an intriguing ability of large language models (LLMs). Despite a substantial amount of work on its behavioral aspects and how it emerges in miniature setups, it remains unclear which mechanism assembles…

September 18, 2025

Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning

arXiv:2508.18397v2 Announce Type: replace-cross Abstract: Offline Reinforcement Learning (RL) presents a promising paradigm for training autonomous vehicle (AV) planning policies from large-scale, real-world driving logs. However, the extreme data imbalance in these logs, where mundane scenarios vastly outnumber rare “long-tail”…

September 18, 2025

Enabling Local Neural Operators to perform Equation-Free System-Level Analysis

arXiv:2505.02308v2 Announce Type: replace Abstract: Neural Operators (NOs) provide a powerful framework for computations involving physical laws that can be modelled by (integro-) partial differential equations (PDEs), directly learning maps between infinite-dimensional function spaces that bypass both the explicit equation…

September 18, 2025

HAM: Hierarchical Adapter Merging for Scalable Continual Learning

arXiv:2509.13211v2 Announce Type: replace Abstract: Continual learning is an essential capability of human cognition, yet it poses significant challenges for current deep learning models. The primary issue is that new knowledge can interfere with previously learned information, causing the model…

September 18, 2025

Privacy-Aware In-Context Learning for Large Language Models

arXiv:2509.13625v1 Announce Type: new Abstract: Large language models (LLMs) have significantly transformed natural language understanding and generation, but they raise privacy concerns due to potential exposure of sensitive information. Studies have highlighted the risk of information leakage, where adversaries can…

September 18, 2025

DeepLogit: A sequentially constrained explainable deep learning modeling approach for transport policy analysis

arXiv:2509.13633v1 Announce Type: new Abstract: Despite the significant progress of deep learning models in multitude of applications, their adaption in planning and policy related areas remains challenging due to the black-box nature of these models. In this work, we develop…

September 18, 2025

Is GPT-4o mini Blinded by its Own Safety Filters? Exposing the Multimodal-to-Unimodal Bottleneck in Hate Speech Detection

arXiv:2509.13608v1 Announce Type: new Abstract: As Large Multimodal Models (LMMs) become integral to daily digital life, understanding their safety architectures is a critical problem for AI Alignment. This paper presents a systematic analysis of OpenAI’s GPT-4o mini, a globally deployed…

September 18, 2025

Unsupervised Anomaly Detection in ALS EPICS Event Logs

arXiv:2509.13621v1 Announce Type: new Abstract: This paper introduces an automated fault analysis framework for the Advanced Light Source (ALS) that processes real-time event logs from its EPICS control system. By treating log entries as natural language, we transform them into…

September 18, 2025

Meta-Learning Linear Models for Molecular Property Prediction

arXiv:2509.13527v1 Announce Type: new Abstract: Chemists in search of structure-property relationships face great challenges due to limited high quality, concordant datasets. Machine learning (ML) has significantly advanced predictive capabilities in chemical sciences, but these modern data-driven approaches have increased the…

September 18, 2025

AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions

arXiv:2509.13523v1 Announce Type: new Abstract: Generative machine learning offers new opportunities to better understand complex Earth system dynamics. Recent diffusion-based methods address spectral biases and improve ensemble calibration in weather forecasting compared to deterministic methods, yet have so far proven…

September 18, 2025