Archives AI News

Multi-Token Prediction via Self-Distillation

arXiv:2602.06019v2 Announce Type: replace-cross Abstract: Existing techniques for accelerating language model inference, such as speculative decoding, require training auxiliary speculator models and building and deploying complex inference pipelines. We consider a new approach for converting a pretrained autoregressive language model…

April 27, 2026

Optimal sequential decision-making for error propagation mitigation in digital twins

arXiv:2604.22168v1 Announce Type: new Abstract: Here, we explore the problem of error propagation mitigation in modular digital twins as a sequential decision process. Building on a companion study that used a Hidden Markov Model (HMM) to infer latent error regimes…

April 27, 2026

Segmentation of Gray Matters and White Matters from Brain MRI data

arXiv:2603.29171v3 Announce Type: replace-cross Abstract: Accurate segmentation of brain tissues such as gray matter and white matter from magnetic resonance imaging is essential for studying brain anatomy, diagnosing neurological disorders, and monitoring disease progression. Traditional methods, such as FSL FAST,…

April 27, 2026

ReCast: Recasting Learning Signals for Reinforcement Learning in Generative Recommendation

arXiv:2604.22169v1 Announce Type: new Abstract: Generic group-based RL assumes that sampled rollout groups are already usable learning signals. We show that this assumption breaks down in sparse-hit generative recommendation, where many sampled groups never become learnable at all. We propose…

April 27, 2026

Useful nonrobust features are ubiquitous in biomedical images

arXiv:2604.22579v1 Announce Type: cross Abstract: We study whether deep networks for medical imaging learn useful nonrobust features – predictive input patterns that are not human interpretable and highly susceptible to small adversarial perturbations – and how these features impact test…

April 27, 2026

Sharpness-Aware Poisoning: Enhancing Transferability of Injective Attacks on Recommender Systems

arXiv:2604.22170v1 Announce Type: new Abstract: Recommender Systems~(RS) have been shown to be vulnerable to injective attacks, where attackers inject limited fake user profiles to promote the exposure of target items to real users for unethical gains (e.g., economic or political…

April 27, 2026

Mixed Membership sub-Gaussian Models

arXiv:2604.22633v1 Announce Type: cross Abstract: The Gaussian mixture model is widely used in unsupervised learning, owing to its simplicity and interpretability. However, a fundamental limitation of the classical Gaussian mixture model is that it forces each observation to belong to…

April 27, 2026

A faster way to estimate AI power consumption

The “EnergAIzer” method generates reliable results in seconds, enabling data center operators to efficiently allocate resources and reduce wasted energy.

April 27, 2026

LLMPhy: Parameter-Identifiable Physical Reasoning Combining Large Language Models and Physics Engines

arXiv:2411.08027v3 Announce Type: replace Abstract: Most learning-based approaches to complex physical reasoning sidestep the crucial problem of parameter identification (e.g., mass, friction) that governs scene dynamics, despite its importance in real-world applications such as collision avoidance and robotic manipulation. In…

April 27, 2026

Presenting DiaData for Research on Type 1 Diabetes

arXiv:2508.09160v2 Announce Type: replace Abstract: Type 1 diabetes (T1D) is an autoimmune disorder that leads to the destruction of insulin-producing cells, resulting in insulin deficiency, as to why the affected individuals depend on external insulin injections. However, insulin can decrease…

April 27, 2026