Archives AI News

Audio2Tool: Speak, Call, Act — A Dataset for Benchmarking Speech Tool Use

arXiv:2604.22821v2 Announce Type: replace-cross Abstract: Voice assistants increasingly rely on Speech Language Models (SpeechLMs) to interpret spoken queries and execute complex tasks, yet existing benchmarks lack domain breadth, acoustic diversity, and compositional reasoning complexity to evaluate tool-calling performance. We introduce…

April 29, 2026

Transformer Approximations from ReLUs

arXiv:2604.24878v1 Announce Type: new Abstract: We provide a systematic recipe for translating ReLU approximation results to softmax attention mechanism. This recipe covers many common approximation targets. Importantly, it yields target-specific, economic resource bounds beyond universal approximation statements. We showcase the…

April 29, 2026

Residual-loss Anomaly Analysis of Physics-Informed Neural Networks: An Inverse Method for Change-point Detection in Nonlinear Dynamical Systems with Regime Switching

arXiv:2604.25655v1 Announce Type: cross Abstract: Nonlinear dynamical systems with regime transitions are typically described by ordinary differential equations with jumping parameters parameters. Traditional methods often treat change-point detection and parameter estimation as separate tasks, ignoring the inherent coupling between them.…

April 29, 2026

Contrastive Image-Metadata Pre-Training for Materials Transmission Electron Microscopy

arXiv:2604.24909v1 Announce Type: new Abstract: The vast majority of transmission electron microscopy (TEM) data never gets published and ends up on a backup drive until deleted to free up space. These left-over datasets are rich in detail and variation, often…

April 29, 2026

Variational Neural Belief Parameterizations for Robust Dexterous Grasping under Multimodal Uncertainty

arXiv:2604.25897v1 Announce Type: cross Abstract: Contact variability, sensing uncertainty, and external disturbances make grasp execution stochastic. Expected-quality objectives ignore tail outcomes and often select grasps that fail under adverse contact realizations. Risk-sensitive POMDPs address this failure mode, but many use…

April 29, 2026

Learning with Embedded Linear Equality Constraints via Variational Bayesian Inference

arXiv:2604.24911v1 Announce Type: new Abstract: Machine Learning is becoming more prevalent in science and engineering, but many approaches do not provide meaningful uncertainty estimates and predictions may also violate known physical knowledge. We propose a Bayesian framework to embed linear…

April 29, 2026

ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

arXiv:2410.24214v3 Announce Type: replace Abstract: Mixed precision quantization has become an important technique for optimizing the execution of deep neural networks (DNNs). Certified robustness, which provides provable guarantees about a model’s ability to withstand different adversarial perturbations, has rarely been…

April 29, 2026

Generative diffusion models for spatiotemporal influenza forecasting

arXiv:2604.24913v1 Announce Type: new Abstract: Forecasting infectious disease incidence can provide important information to guide public health planning, yet is difficult because epidemic dynamics are complex. Current mechanistic and statistical approaches often struggle to capture multimodal uncertainty or emergent trends.…

April 29, 2026

Revisiting the Past: Data Unlearning with Model State History

arXiv:2506.20941v3 Announce Type: replace Abstract: Large language models are trained on massive corpora of web data, which may include private data, copyrighted material, factually inaccurate data, or data that degrades model performance. Eliminating the influence of such problematic datapoints on…

April 29, 2026

A Unifying Framework for Unsupervised Concept Extraction

arXiv:2604.24936v1 Announce Type: new Abstract: Techniques for concept extraction, such as sparse autoencoders and transcoders, aim to extract high-level symbolic concepts from low-level nonsymbolic representations. When these extracted concepts are used for downstream tasks such as model steering and unlearning,…

April 29, 2026