Archives AI News

MAPLE: Elevating Medical Reasoning from Statistical Consensus to Process-Led Alignment

arXiv:2603.08987v1 Announce Type: new Abstract: Recent advances in medical large language models have explored Test-Time Reinforcement Learning (TTRL) to enhance reasoning. However, standard TTRL often relies on majority voting (MV) as a heuristic supervision signal, which can be unreliable in…

March 11, 2026

AlphaApollo: A System for Deep Agentic Reasoning

arXiv:2510.06261v2 Announce Type: replace-cross Abstract: We present AlphaApollo, an agentic reasoning system that targets two bottlenecks in foundation-model reasoning: (1) limited reasoning capacity for complex, long-horizon problem solving and (2) unreliable test-time evolution without trustworthy verification. AlphaApollo orchestrates models and…

March 11, 2026

The Coupling Within: Flow Matching via Distilled Normalizing Flows

arXiv:2603.09014v1 Announce Type: new Abstract: Flow models have rapidly become the go-to method for training and deploying large-scale generators, owing their success to inference-time flexibility via adjustable integration steps. A crucial ingredient in flow training is the choice of coupling…

March 11, 2026

Missing-by-Design: Certifiable Modality Deletion for Revocable Multimodal Sentiment Analysis

arXiv:2602.16144v2 Announce Type: replace-cross Abstract: As multimodal systems increasingly process sensitive personal data, the ability to selectively revoke specific data modalities has become a critical requirement for privacy compliance and user autonomy. We present Missing-by-Design (MBD), a unified framework for…

March 11, 2026

An accurate flatness measure to estimate the generalization performance of CNN models

arXiv:2603.09016v1 Announce Type: new Abstract: Flatness measures based on the spectrum or the trace of the Hessian of the loss are widely used as proxies for the generalization ability of deep networks. However, most existing definitions are either tailored to…

March 11, 2026

TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge

arXiv:2603.09511v1 Announce Type: cross Abstract: On-device tuning of deep neural networks enables long-term adaptation at the edge while preserving data privacy. However, the high computational and memory demands of backpropagation pose significant challenges for ultra-low-power, memory-constrained extreme-edge devices. These challenges…

March 11, 2026

When to Retrain after Drift: A Data-Only Test of Post-Drift Data Size Sufficiency

arXiv:2603.09024v1 Announce Type: new Abstract: Sudden concept drift makes previously trained predictors unreliable, yet deciding when to retrain and what post-drift data size is sufficient is rarely addressed. We propose CALIPER – a detector- and model-agnostic, data-only test that estimates…

March 11, 2026

Global universality via discrete-time signatures

arXiv:2603.09773v1 Announce Type: cross Abstract: We establish global universal approximation theorems on spaces of piecewise linear paths, stating that linear functionals of the corresponding signatures are dense with respect to $L^p$- and weighted norms, under an integrability condition on the…

March 11, 2026

Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning

arXiv:2603.09032v1 Announce Type: new Abstract: Scientific machine learning (SciML) is increasingly applied to in-field processing, controlling, and monitoring; however, wide-area sensing, real-time demands, and strict energy and reliability constraints make centralized SciML implementation impractical. Most SciML models assume raw data…

March 11, 2026

Sparse Variational Student-t Processes for Heavy-tailed Modeling

arXiv:2408.06699v2 Announce Type: replace Abstract: The Gaussian process (GP) is a powerful tool for nonparametric modeling, but its sensitivity to outliers limits its applicability to data distributions with heavy-tails. Studentt processes offer a robust alternative for heavy tail modeling, but…

March 11, 2026