Archives AI News

VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

arXiv:2510.07978v2 Announce Type: replace-cross Abstract: Large-scale Speech Language Models (SpeechLMs) have enabled voice assistants capable of understanding natural spoken queries and performing complex tasks. However, existing speech benchmarks primarily focus on isolated capabilities such as transcription, or question-answering, and do…

The Curved Spacetime of Transformer Architectures

arXiv:2511.03060v1 Announce Type: new Abstract: We present a geometric framework for understanding Transformer-based language models, drawing an explicit analogy to General Relativity. Queries and keys induce an effective metric on representation space, and attention acts as a discrete connection that…

Efficient Testing Implies Structured Symmetry

arXiv:2511.03653v1 Announce Type: cross Abstract: Given a small random sample of $n$-bit strings labeled by an unknown Boolean function, which properties of this function can be tested computationally efficiently? We show an equivalence between properties that are efficiently testable from…

REINFORCE-ING Chemical Language Models for Drug Discovery

arXiv:2501.15971v2 Announce Type: replace Abstract: Chemical language models, combined with reinforcement learning (RL), have shown significant promise to efficiently traverse large chemical spaces for drug discovery. However, the performance of various RL algorithms and their best practices for practical drug…

Sparse, self-organizing ensembles of local kernels detect rare statistical anomalies

arXiv:2511.03095v1 Announce Type: new Abstract: Modern artificial intelligence has revolutionized our ability to extract rich and versatile data representations across scientific disciplines. Yet, the statistical properties of these representations remain poorly controlled, causing misspecified anomaly detection (AD) methods to falter.…

NeuralSurv: Deep Survival Analysis with Bayesian Uncertainty Quantification

arXiv:2505.11054v2 Announce Type: replace Abstract: We introduce NeuralSurv, the first deep survival model to incorporate Bayesian uncertainty quantification. Our non-parametric, architecture-agnostic framework captures time-varying covariate-risk relationships in continuous time via a novel two-stage data-augmentation scheme, for which we establish theoretical…

Q&A: How folk ballads explain the world

Ruth Perry’s new book profiles Anna Gordon, a Scotswoman who preserved and transmitted precious popular ballads, and with them national traditions.