Archives AI News

Knowing When to Quit: Probabilistic Early Exits for Speech Separation

arXiv:2507.09768v3 Announce Type: replace Abstract: In recent years, deep learning-based single-channel speech separation has improved considerably, in large part driven by increasingly compute- and parameter-efficient neural network architectures. Most such architectures are, however, designed with a fixed compute and parameter…

Circuit Insights: Towards Interpretability Beyond Activations

arXiv:2510.14936v2 Announce Type: replace Abstract: The fields of explainable AI and mechanistic interpretability aim to uncover the internal structure of neural networks, with circuit discovery as a central tool for understanding model computations. Existing approaches, however, rely on manual inspection…

Solving adversarial examples requires solving exponential misalignment

arXiv:2603.03507v1 Announce Type: new Abstract: Adversarial attacks – input perturbations imperceptible to humans that fool neural networks – remain both a persistent failure mode in machine learning, and a phenomenon with mysterious origins. To shed light, we define and analyze…

Agnostic Tomography of Stabilizer Product States

arXiv:2404.03813v5 Announce Type: replace-cross Abstract: We define a quantum learning task called agnostic tomography, where given copies of an arbitrary state $rho$ and a class of quantum states $mathcal{C}$, the goal is to output a succinct description of a state…

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

arXiv:2603.03517v1 Announce Type: new Abstract: General-purpose large language models (LLMs) that rely on in-context learning do not reliably deliver the scientific understanding and performance required for drug discovery tasks. Simply increasing model size or introducing reasoning tokens does not yield…

Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition

arXiv:2506.18703v3 Announce Type: replace-cross Abstract: Neural sequence-to-sequence systems deliver state-of-the-art performance for automatic speech recognition. When using appropriate modeling units, e.g., byte-pair encoding, these systems are in principle open vocabulary systems. In practice, however, they often fail to recognize words…