Archives AI News

EventADL: Open-Box Anomaly Detection and Localization Framework for Events in Cloud-Based Service Systems

arXiv:2605.00936v1 Announce Type: new Abstract: Anomaly detection and localization (ADL) is critical for maintaining reliability and availability in cloud systems. Recent ADL developments focus on metric and log data, leaving event data unexplored. To address this gap, we propose EventADL,…

May 5, 2026

The Geometric Inductive Bias of Grokking: Bypassing Phase Transitions via Architectural Topology

arXiv:2603.05228v3 Announce Type: replace Abstract: Mechanistic interpretability typically relies on post-hoc analysis of trained networks. We instead adopt an interventional approach: testing hypotheses a priori by modifying architectural topology to observe training dynamics. We study grokking – delayed generalization in…

May 5, 2026

Fusing Urban Structure and Semantics: A Conditional Diffusion Model for Cross-City OD Matrix Generation

arXiv:2605.00938v1 Announce Type: new Abstract: Accurate modeling of commuting flows is important for urban governance, traffic planning, and resource allocation. However, the combined influence of individual intentions, geographic constraints, and social dynamics leads to considerable heterogeneity in commuting patterns, making…

May 5, 2026

The tech revolution that wasn’t

Dwai Banerjee’s new book examines the visionaries who wanted to turn India into a world power at making computers.

May 5, 2026

Astronomers pin down the origins of a planetary odd couple

New measurements of a hot Jupiter and its mini-Neptune companion suggest both planets formed surprisingly far away from their host star.

May 5, 2026

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

arXiv:2504.20605v2 Announce Type: replace-cross Abstract: Moral stories are a time-tested vehicle for transmitting values, yet modern NLP lacks a large, structured corpus that couples coherent narratives with explicit ethical lessons. We present TF1-EN-3M, to our knowledge the first open dataset…

May 5, 2026

Split-on-Share: Mixture of Sparse Experts for Task-Agnostic Continual Learning

arXiv:2601.17616v2 Announce Type: replace Abstract: Continual learning in Large Language Models (LLMs) is hindered by the plasticity-stability dilemma, where acquiring new capabilities often leads to catastrophic forgetting of previous knowledge. Existing methods typically treat parameters uniformly, failing to distinguish between…

May 5, 2026

Schema-Adaptive Tabular Representation Learning with LLMs for Generalizable Multimodal Clinical Reasoning

arXiv:2604.11835v2 Announce Type: replace Abstract: Machine learning for tabular data remains constrained by poor schema generalization, a challenge rooted in the lack of semantic understanding of structured variables. This challenge is particularly acute in domains like clinical medicine, where electronic…

May 5, 2026

U-Define: Designing User Workflows for Hard and Soft Constraints in LLM-Based Planning

arXiv:2605.02765v1 Announce Type: cross Abstract: LLMs are increasingly used for end-user task planning, yet their black-box nature limits users’ ability to ensure reliability and control. While recent systems incorporate verification techniques, it remains unclear how users can effectively apply such…

May 5, 2026

Time-series forecasting through the lens of dynamics

arXiv:2507.15774v2 Announce Type: replace Abstract: While deep learning is facing an homogenization across modalities led by Transformers, they are still challenged by shallow linear models in the time-series forecasting task. Our hypothesis is that models should learn a direct link…

May 5, 2026