Archives AI News

VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

arXiv:2510.07978v2 Announce Type: replace-cross Abstract: Large-scale Speech Language Models (SpeechLMs) have enabled voice assistants capable of understanding natural spoken queries and performing complex tasks. However, existing speech benchmarks primarily focus on isolated capabilities such as transcription, or question-answering, and do…

November 6, 2025

The Curved Spacetime of Transformer Architectures

arXiv:2511.03060v1 Announce Type: new Abstract: We present a geometric framework for understanding Transformer-based language models, drawing an explicit analogy to General Relativity. Queries and keys induce an effective metric on representation space, and attention acts as a discrete connection that…

November 6, 2025

Efficient Testing Implies Structured Symmetry

arXiv:2511.03653v1 Announce Type: cross Abstract: Given a small random sample of $n$-bit strings labeled by an unknown Boolean function, which properties of this function can be tested computationally efficiently? We show an equivalence between properties that are efficiently testable from…

November 6, 2025

Homomorphism distortion: A metric to distinguish them all and in the latent space bind them

arXiv:2511.03068v1 Announce Type: new Abstract: For far too long, expressivity of graph neural networks has been measured emph{only} in terms of combinatorial properties. In this work we stray away from this tradition and provide a principled way to measure similarity…

November 6, 2025

AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization

arXiv:2405.18187v2 Announce Type: replace Abstract: Implicit Q-learning (IQL) serves as a strong baseline for offline RL, which learns the value function using only dataset actions through quantile regression. However, it is unclear how to recover the implicit policy from the…

November 6, 2025

Online Learning to Rank under Corruption: A Robust Cascading Bandits Approach

arXiv:2511.03074v1 Announce Type: new Abstract: Online learning to rank (OLTR) studies how to recommend a short ranked list of items from a large pool and improves future rankings based on user clicks. This setting is commonly modeled as cascading bandits,…

November 6, 2025

REINFORCE-ING Chemical Language Models for Drug Discovery

arXiv:2501.15971v2 Announce Type: replace Abstract: Chemical language models, combined with reinforcement learning (RL), have shown significant promise to efficiently traverse large chemical spaces for drug discovery. However, the performance of various RL algorithms and their best practices for practical drug…

November 6, 2025

Sparse, self-organizing ensembles of local kernels detect rare statistical anomalies

arXiv:2511.03095v1 Announce Type: new Abstract: Modern artificial intelligence has revolutionized our ability to extract rich and versatile data representations across scientific disciplines. Yet, the statistical properties of these representations remain poorly controlled, causing misspecified anomaly detection (AD) methods to falter.…

November 6, 2025

NeuralSurv: Deep Survival Analysis with Bayesian Uncertainty Quantification

arXiv:2505.11054v2 Announce Type: replace Abstract: We introduce NeuralSurv, the first deep survival model to incorporate Bayesian uncertainty quantification. Our non-parametric, architecture-agnostic framework captures time-varying covariate-risk relationships in continuous time via a novel two-stage data-augmentation scheme, for which we establish theoretical…

November 6, 2025

Q&A: How folk ballads explain the world

Ruth Perry’s new book profiles Anna Gordon, a Scotswoman who preserved and transmitted precious popular ballads, and with them national traditions.

November 6, 2025