Archives AI News

Rater Equivalence: Evaluating Classifiers in Human Judgment Settings

arXiv:2106.01254v2 Announce Type: replace Abstract: In many decision settings, the definitive ground truth is either non-existent or inaccessible. We introduce a framework for evaluating classifiers based solely on human judgments. In such cases, it is helpful to compare automated classifiers…

November 7, 2025

From Static to Dynamic: Enhancing Offline-to-Online Reinforcement Learning via Energy-Guided Diffusion Stratification

arXiv:2511.03828v1 Announce Type: new Abstract: Transitioning from offline to online reinforcement learning (RL) poses critical challenges due to distributional shifts between the fixed behavior policy in the offline dataset and the evolving policy during online learning. Although this issue is…

November 7, 2025

How Memory in Optimization Algorithms Implicitly Modifies the Loss

arXiv:2502.02132v2 Announce Type: replace Abstract: In modern optimization methods used in deep learning, each update depends on the history of previous iterations, often referred to as memory, and this dependence decays fast as the iterates go further into the past.…

November 7, 2025

Higher-Order Causal Structure Learning with Additive Models

arXiv:2511.03831v1 Announce Type: new Abstract: Causal structure learning has long been the central task of inferring causal insights from data. Despite the abundance of real-world processes exhibiting higher-order mechanisms, however, an explicit treatment of interactions in causal discovery has received…

November 7, 2025

Explicit Density Approximation for Neural Implicit Samplers Using a Bernstein-Based Convex Divergence

arXiv:2506.04700v2 Announce Type: replace Abstract: Rank-based statistical metrics, such as the invariant statistical loss (ISL), have recently emerged as robust and practically effective tools for training implicit generative models. In this work, we introduce dual-ISL, a novel likelihood-free objective for…

November 7, 2025

Enhancing Q-Value Updates in Deep Q-Learning via Successor-State Prediction

arXiv:2511.03836v1 Announce Type: new Abstract: Deep Q-Networks (DQNs) estimate future returns by learning from transitions sampled from a replay buffer. However, the target updates in DQN often rely on next states generated by actions from past, potentially suboptimal, policy. As…

November 7, 2025

Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning

arXiv:2510.17923v2 Announce Type: replace Abstract: Reinforcement Learning (RL) has emerged as a powerful paradigm for advancing Large Language Models (LLMs), achieving remarkable performance in complex reasoning domains such as mathematics and code generation. However, current RL methods face a fundamental…

November 7, 2025

Benchmark Datasets for Lead-Lag Forecasting on Social Platforms

arXiv:2511.03877v1 Announce Type: new Abstract: Social and collaborative platforms emit multivariate time-series traces in which early interactions-such as views, likes, or downloads-are followed, sometimes months or years later, by higher impact like citations, sales, or reviews. We formalize this setting…

November 7, 2025

Measure-Theoretic Time-Delay Embedding

arXiv:2409.08768v2 Announce Type: replace-cross Abstract: The celebrated Takens’ embedding theorem provides a theoretical foundation for reconstructing the full state of a dynamical system from partial observations. However, the classical theorem assumes that the underlying system is deterministic and that observations…

November 7, 2025

DecoHD: Decomposed Hyperdimensional Classification under Extreme Memory Budgets

arXiv:2511.03911v1 Announce Type: new Abstract: Decomposition is a proven way to shrink deep networks without changing I We bring this idea to hyperdimensional computing (HDC), where footprint cuts usually shrink the feature axis and erode concentration and robustness. Prior HDC…

November 7, 2025