Archives AI News

Enabling Global, Human-Centered Explanations for LLMs:From Tokens to Interpretable Code and Test Generation

arXiv:2503.16771v3 Announce Type: replace-cross Abstract: As Large Language Models for Code (LM4Code) become integral to software engineering, establishing trust in their output becomes critical. However, standard accuracy metrics obscure the underlying reasoning of generative models, offering little insight into how…

April 14, 2026

Efficient Personalization of Generative User Interfaces

arXiv:2604.09876v1 Announce Type: new Abstract: Generative user interfaces (UIs) create new opportunities to adapt interfaces to individual users on demand, but personalization remains difficult because desirable UI properties are subjective, hard to articulate, and costly to infer from sparse feedback.…

April 14, 2026

Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?

arXiv:2510.27269v3 Announce Type: replace-cross Abstract: Reasoning language models (RLMs) achieve strong performance on complex reasoning tasks, yet they still exhibit a multilingual reasoning gap, performing better in high-resource languages than in low-resource ones. While recent efforts have been made to…

April 14, 2026

SemEnrich: Self-Supervised Semantic Enrichment of Radiology Reports for Vision-Language Learning

arXiv:2604.09887v1 Announce Type: new Abstract: Medical vision-language datasets are often limited in size and biased toward negative findings, as clinicians report abnormalities mostly but might omit some positive/neutral findings because they might be considered as irrelevant to the patient’s condition.…

April 14, 2026

FlexMS is a flexible framework for benchmarking deep learning-based mass spectrum prediction tools in metabolomics

arXiv:2602.22822v2 Announce Type: replace-cross Abstract: The identification and property prediction of chemical molecules is of central importance in the advancement of drug discovery and material science, where the tandem mass spectrometry technology gives valuable fragmentation cues in the form of…

April 14, 2026

Improving Pediatric Emergency Department Triage with Modality Dropout in Late Fusion Multimodal EHR Models

arXiv:2604.09905v1 Announce Type: new Abstract: Emergency department triage relies heavily on both quantitative vital signs and qualitative clinical notes, yet multimodal machine learning models predicting triage acuity often suffer from modality collapse by over-relying on structured tabular data. This limitation…

April 14, 2026

Select Smarter, Not More: Prompt-Aware Evaluation Scheduling with Submodular Guarantees

arXiv:2604.11328v1 Announce Type: cross Abstract: Automatic prompt optimization (APO) hinges on the quality of its evaluation signal, yet scoring every prompt candidate on the full training set is prohibitively expensive. Existing methods either fix a single evaluation subset before optimization…

April 14, 2026

Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size

arXiv:2604.09909v1 Announce Type: new Abstract: We study last-iterate convergence of SGD with greedy step size over smooth quadratics in the interpolation regime, a setting which captures the classical Randomized Kaczmarz algorithm as well as other popular iterative linear system solvers.…

April 14, 2026

Utilizing and Calibrating Hindsight Process Rewards via Reinforcement with Mutual Information Self-Evaluation

arXiv:2604.11611v1 Announce Type: cross Abstract: To overcome the sparse reward challenge in reinforcement learning (RL) for agents based on large language models (LLMs), we propose Mutual Information Self-Evaluation (MISE), an RL paradigm that utilizes hindsight generative self-evaluation as dense reward…

April 14, 2026

Regularized Entropy Information Adaptation with Temporal-Awareness Networks for Simultaneous Speech Translation

arXiv:2604.09916v1 Announce Type: new Abstract: Simultaneous Speech Translation (SimulST) requires balancing high translation quality with low latency. Recent work introduced REINA, a method that trains a Read/Write policy based on estimating the information gain of reading more audio. However, we…

April 14, 2026