Archives AI News

Pruning Deep Neural Networks via the Marchenko–Pastur Distribution

arXiv:2606.02608v1 Announce Type: new Abstract: We study a Marchenko–Pastur (MP) random-matrix approach to pruning deep neural networks with very small post-pruning fine-tuning budgets. The main practical contribution is accuracy retention under short calibration and fine-tuning schedules, rather than a long…

June 3, 2026

Forecasting Conceptual Diffusion in Science: The Case of Quantum Computing

arXiv:2606.03919v1 Announce Type: cross Abstract: Understanding and anticipating scientific change requires models that distinguish between endogenous consolidation and exogenous diffusion of scientific concepts. Using the quantum computing subtree of concepts in OpenAlex, we construct a temporally resolved concept co-occurrence network…

June 3, 2026

Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning

arXiv:2603.03480v2 Announce Type: replace Abstract: We study reinforcement learning with delayed state observation, where the agent observes the current state after some random number of time steps. We propose an algorithm that combines the augmentation method and the upper confidence…

June 3, 2026

Backdooring Masked Diffusion Language Models

arXiv:2605.19262v2 Announce Type: replace Abstract: Masked diffusion language models (MDLMs) are emerging as a compelling new paradigm for text generation, but their training-time security remains largely unexplored. Existing backdoor attacks on Gaussian diffusion models or autoregressive language models do not…

June 3, 2026

RogueMerge: Robust and Unified Attacks against LLM Model Merging

arXiv:2606.03344v1 Announce Type: cross Abstract: Model merging composes specialized capabilities into a single LLM by aggregating task vectors sourced from unverified public platforms, exposing a critical supply-chain attack surface: Because any malicious behavior can be encoded into a task vector,…

June 3, 2026

Building Better Activation Oracles

arXiv:2606.02609v1 Announce Type: new Abstract: Activation Oracles (AOs) are promising methods for interpreting residual stream activations. However, current AOs face important issues, such as hallucinations and vagueness. Additionally, text-inversion confounds make them hard to evaluate. To this end, we improve…

June 3, 2026

Cross-Modal Contrastive Learning of ECG and Angiography Representations for Severe Stenosis Classification

arXiv:2606.02605v1 Announce Type: new Abstract: Coronary artery stenosis is a common cardiovascular disease, with severe, untreated cases posing significant risks of heart attack. Although coronary (X-ray) angiograms remain the standard for stenosis diagnosis, they are invasive, time- and resource-intensive, and…

June 3, 2026

Resource-Constrained Adaptive Inference for Sequential Pricing

arXiv:2606.03736v1 Announce Type: cross Abstract: Resource-constrained pricing controllers can make fixed-price inference impossible: the controller’s resource state may remove the target price neighborhood from the feasible set, even when every realized action has a known positive density. We formalize this…

June 3, 2026

LeAP: Learnable Adaptive Permutation for Feature Selection in Heterogeneous and Sparse Recommender Systems

arXiv:2606.01111v2 Announce Type: replace Abstract: Modern industrial recommender systems rely on thousands of heterogeneous features — ranging from low-dimensional scalars (e.g., statistical value) to high-dimensional embeddings (e.g., user-id embeddings, MLP representations) — to achieve high-precision predictions. Given the immense computational…

June 3, 2026

ReLoRA: Knowledge-Reusing Adaptation for Fast Rollout of Evolving LLM Services

arXiv:2606.02606v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed as continuously evolving services, where frequent base-model updates may invalidate previously deployed task-specific Low-Rank Adaptation (LoRA) adapters. For service providers managing numerous downstream model services, retraining each LoRA…

June 3, 2026