Archives AI News

Multimodal LLM-assisted Evolutionary Search for Programmatic Control Policies

arXiv:2508.05433v3 Announce Type: replace Abstract: Deep reinforcement learning has achieved impressive success in control tasks. However, its policies, represented as opaque neural networks, are often difficult for humans to understand, verify, and debug, which undermines trust and hinders real-world deployment.…

Learning Adaptive LLM Decoding

arXiv:2603.09065v1 Announce Type: new Abstract: Decoding from large language models (LLMs) typically relies on fixed sampling hyperparameters (e.g., temperature, top-p), despite substantial variation in task difficulty and uncertainty across prompts and individual decoding steps. We propose to learn adaptive decoding…

DUEL: Exact Likelihood for Masked Diffusion via Deterministic Unmasking

arXiv:2603.01367v2 Announce Type: replace Abstract: Masked diffusion models (MDMs) generate text by iteratively selecting positions to unmask and then predicting tokens at those positions. Yet MDMs lack proper likelihood evaluation: the evidence lower bound (ELBO) is not only a loose…

Rating Quality of Diverse Time Series Data by Meta-learning from LLM Judgment

arXiv:2506.01290v2 Announce Type: replace Abstract: High-quality time series (TS) data are essential for ensuring TS model performance, rendering research on rating TS data quality indispensable. Existing methods have shown promising rating accuracy within individual domains, primarily by extending data quality…

Structured Matrix Scaling for Multi-Class Calibration

arXiv:2511.03685v2 Announce Type: replace Abstract: Post-hoc recalibration methods are widely used to ensure that classifiers provide faithful probability estimates. We argue that parametric recalibration functions based on logistic regression can be motivated from a simple theoretical setting for both binary…

Unsupervised Representation Learning from Sparse Transformation Analysis

arXiv:2410.05564v3 Announce Type: replace Abstract: There is a vast literature on representation learning based on principles such as coding efficiency, statistical independence, causality, controllability, or symmetry. In this paper we propose to learn representations from sequence data by factorizing the…