Archives AI News

Adversarial Latent-State Training for Robust Policies in Partially Observable Domains

arXiv:2603.07313v2 Announce Type: replace Abstract: Robustness under latent distribution shift remains challenging in partially observable reinforcement learning. We formalize a focused setting where an adversary selects a hidden initial latent distribution before the episode, termed an adversarial latent-initial-state POMDP. Theoretically,…

March 11, 2026

MAcPNN: Mutual Assisted Learning on Data Streams with Temporal Dependence

arXiv:2603.08972v1 Announce Type: new Abstract: Internet of Things (IoT) Analytics often involves applying machine learning (ML) models on data streams. In such scenarios, traditional ML paradigms face obstacles related to continuous learning while dealing with concept drifts, temporal dependence, and…

March 11, 2026

On the Impact of the Utility in Semivalue-based Data Valuation

arXiv:2502.06574v4 Announce Type: replace-cross Abstract: Semivalue-based data valuation uses cooperative-game theory intuitions to assign each data point a value reflecting its contribution to a downstream task. Still, those values depend on the practitioner’s choice of utility, raising the question: How…

March 11, 2026

MAPLE: Elevating Medical Reasoning from Statistical Consensus to Process-Led Alignment

arXiv:2603.08987v1 Announce Type: new Abstract: Recent advances in medical large language models have explored Test-Time Reinforcement Learning (TTRL) to enhance reasoning. However, standard TTRL often relies on majority voting (MV) as a heuristic supervision signal, which can be unreliable in…

March 11, 2026

AlphaApollo: A System for Deep Agentic Reasoning

arXiv:2510.06261v2 Announce Type: replace-cross Abstract: We present AlphaApollo, an agentic reasoning system that targets two bottlenecks in foundation-model reasoning: (1) limited reasoning capacity for complex, long-horizon problem solving and (2) unreliable test-time evolution without trustworthy verification. AlphaApollo orchestrates models and…

March 11, 2026

The Coupling Within: Flow Matching via Distilled Normalizing Flows

arXiv:2603.09014v1 Announce Type: new Abstract: Flow models have rapidly become the go-to method for training and deploying large-scale generators, owing their success to inference-time flexibility via adjustable integration steps. A crucial ingredient in flow training is the choice of coupling…

March 11, 2026

Missing-by-Design: Certifiable Modality Deletion for Revocable Multimodal Sentiment Analysis

arXiv:2602.16144v2 Announce Type: replace-cross Abstract: As multimodal systems increasingly process sensitive personal data, the ability to selectively revoke specific data modalities has become a critical requirement for privacy compliance and user autonomy. We present Missing-by-Design (MBD), a unified framework for…

March 11, 2026

An accurate flatness measure to estimate the generalization performance of CNN models

arXiv:2603.09016v1 Announce Type: new Abstract: Flatness measures based on the spectrum or the trace of the Hessian of the loss are widely used as proxies for the generalization ability of deep networks. However, most existing definitions are either tailored to…

March 11, 2026

TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge

arXiv:2603.09511v1 Announce Type: cross Abstract: On-device tuning of deep neural networks enables long-term adaptation at the edge while preserving data privacy. However, the high computational and memory demands of backpropagation pose significant challenges for ultra-low-power, memory-constrained extreme-edge devices. These challenges…

March 11, 2026

When to Retrain after Drift: A Data-Only Test of Post-Drift Data Size Sufficiency

arXiv:2603.09024v1 Announce Type: new Abstract: Sudden concept drift makes previously trained predictors unreliable, yet deciding when to retrain and what post-drift data size is sufficient is rarely addressed. We propose CALIPER – a detector- and model-agnostic, data-only test that estimates…

March 11, 2026