Archives AI News

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

arXiv:2603.03475v1 Announce Type: new Abstract: Mathematical reasoning models are widely deployed in education, automated tutoring, and decision support systems despite exhibiting fundamental computational instabilities. We demonstrate that state-of-the-art models (Qwen2.5-Math-7B) achieve 61% accuracy through a mixture of reliable and unreliable…

March 5, 2026

EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model

arXiv:2511.22935v2 Announce Type: replace Abstract: Electrocardiogram (ECG) analysis plays a vital role in the early detection, monitoring, and management of various cardiovascular conditions. While existing models have achieved notable success in ECG interpretation, they fail to leverage the interrelated nature…

March 5, 2026

Dynamic Adversarial Reinforcement Learning for Robust Multimodal Large Language Models

arXiv:2602.22227v3 Announce Type: replace Abstract: Despite their impressive capabilities, Multimodal Large Language Models (MLLMs) exhibit perceptual fragility when confronted with visually complex scenes. This weakness stems from a reliance on finite training datasets, which are prohibitively expensive to scale and…

March 5, 2026

SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

arXiv:2505.20065v2 Announce Type: replace Abstract: As Large Language Models (LLMs) are increasingly deployed in real-world applications, balancing helpfulness and safety has become a central challenge. A natural approach is to incorporate safety constraints into Reinforcement Learning from Human Feedback (RLHF),…

March 5, 2026

Semi-Supervised Generative Learning via Latent Space Distribution Matching

arXiv:2603.04223v1 Announce Type: cross Abstract: We introduce Latent Space Distribution Matching (LSDM), a novel framework for semi-supervised generative modeling of conditional distributions. LSDM operates in two stages: (i) learning a low-dimensional latent space from both paired and unpaired data, and…

March 5, 2026

List Sample Compression and Uniform Convergence

arXiv:2403.10889v2 Announce Type: replace Abstract: List learning is a variant of supervised classification where the learner outputs multiple plausible labels for each instance rather than just one. We investigate classical principles related to generalization within the context of list learning.…

March 5, 2026

Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer’s MLP Budget

arXiv:2603.03459v1 Announce Type: new Abstract: We investigate when transformer MLP nonlinearity is actually necessary. A gate with $d+1$ parameters decides when to replace the full MLP with a linear surrogate. Through systematic investigation across six models (162M-2.8B parameters), two architectures,…

March 5, 2026

Tracing 3D Anatomy in 2D Strokes: A Multi-Stage Projection Driven Approach to Cervical Spine Fracture Identification

arXiv:2601.15235v3 Announce Type: replace-cross Abstract: Cervical spine fractures demand rapid and accurate diagnosis for effective clinical management. This study presents an automated, end-to-end pipeline for fracture detection across cervical vertebrae (C1–C7) that assesses the feasibility of fracture recognition from vertebra-level…

March 5, 2026

A Short Note on a Variant of the Squint Algorithm

arXiv:2603.03409v1 Announce Type: new Abstract: This short note describes a simple variant of the Squint algorithm of Koolen and Van Erven [2015] for the classic expert problem. Via an equally simple modification of their proof, we prove that this variant…

March 5, 2026

[Re] FairDICE: A Gap Between Theory And Practice

arXiv:2603.03454v1 Announce Type: new Abstract: Offline Reinforcement Learning (RL) is an emerging field of RL in which policies are learned solely from demonstrations. Within offline RL, some environments involve balancing multiple objectives, but existing multi-objective offline RL algorithms do not…

March 5, 2026