Archives AI News

Variable Selection in Maximum Mean Discrepancy for Interpretable Distribution Comparison

arXiv:2311.01537v2 Announce Type: replace-cross Abstract: We study two-sample variable selection: identifying variables that discriminate between the distributions of two sets of data vectors. Such variables help scientists understand the mechanisms behind dataset discrepancies. Although domain-specific methods exist (e.g., in medical…

Data-Efficient Realized Volatility Forecasting with Vision Transformers

arXiv:2511.03046v1 Announce Type: new Abstract: Recent work in financial machine learning has shown the virtue of complexity: the phenomenon by which deep learning methods capable of learning highly nonlinear relationships outperform simpler approaches in financial forecasting. While transformer architectures like…

Unsupervised Evaluation of Multi-Turn Objective-Driven Interactions

arXiv:2511.03047v1 Announce Type: new Abstract: Large language models (LLMs) have seen increasing popularity in enterprise applications where AI agents and humans engage in objective-driven interactions. However, these systems are difficult to evaluate: data may be complex and unlabeled; human annotation…

VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

arXiv:2510.07978v2 Announce Type: replace-cross Abstract: Large-scale Speech Language Models (SpeechLMs) have enabled voice assistants capable of understanding natural spoken queries and performing complex tasks. However, existing speech benchmarks primarily focus on isolated capabilities such as transcription, or question-answering, and do…

The Curved Spacetime of Transformer Architectures

arXiv:2511.03060v1 Announce Type: new Abstract: We present a geometric framework for understanding Transformer-based language models, drawing an explicit analogy to General Relativity. Queries and keys induce an effective metric on representation space, and attention acts as a discrete connection that…

Efficient Testing Implies Structured Symmetry

arXiv:2511.03653v1 Announce Type: cross Abstract: Given a small random sample of $n$-bit strings labeled by an unknown Boolean function, which properties of this function can be tested computationally efficiently? We show an equivalence between properties that are efficiently testable from…

REINFORCE-ING Chemical Language Models for Drug Discovery

arXiv:2501.15971v2 Announce Type: replace Abstract: Chemical language models, combined with reinforcement learning (RL), have shown significant promise to efficiently traverse large chemical spaces for drug discovery. However, the performance of various RL algorithms and their best practices for practical drug…