Archives AI News

UniComp: A Unified Evaluation of Large Language Model Compression via Pruning, Quantization and Distillation

arXiv:2602.09130v4 Announce Type: replace Abstract: Model compression is increasingly essential for deploying large language models (LLMs), yet existing comparative studies largely focus on pruning and quantization evaluated primarily on knowledge-centric benchmarks. Thus, we introduce UniComp, a unified evaluation framework for…

May 7, 2026

Continual Distillation of Teachers from Different Domains

arXiv:2605.04059v1 Announce Type: new Abstract: Deep learning models continue to scale, with some requiring more storage than many large-scale datasets. Thus, we introduce a new paradigm: Continual Distillation (CD), where a student learns sequentially from a stream of teacher models…

May 7, 2026

Lookahead Drifting Model

arXiv:2605.04060v1 Announce Type: new Abstract: Recently, a new paradigm named emph{drifting model} has been proposed for mapping distributions, which achieves the SOTA image generation performance over ImageNet via one-step neural functional evaluation (NFE). The basic idea is to compute a…

May 7, 2026

MP-ISMoE: Mixed-Precision Interactive Side Mixture-of-Experts for Efficient Transfer Learning

arXiv:2605.04058v1 Announce Type: new Abstract: Parameter-efficient transfer learning (PETL) has emerged as a pivotal paradigm for adapting pre-trained foundation models to downstream tasks, significantly reducing trainable parameters yet suffering from substantial memory overhead caused by gradient backpropagation during fine-tuning. While…

May 7, 2026

Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search

arXiv:2605.04057v1 Announce Type: new Abstract: This paper focuses on a key challenge in Neural Architecture Search (NAS): integrating established architectural knowledge while exploring new designs under expensive evaluations. Large language models (LLMs) are a promising assistant for NAS because they…

May 7, 2026

Transformation Categorization Based on Group Decomposition Theory Using Parameter Division

arXiv:2605.04056v1 Announce Type: new Abstract: Representation learning seeks meaningful sensory representations without supervision and can model aspects of human development. Although many neural networks empirically learn useful features, a principled account of what makes a representation “good” remains elusive. We…

May 7, 2026

Bayesian Parameter Shift Rule in Variational Quantum Eigensolvers

arXiv:2502.02625v2 Announce Type: replace Abstract: Parameter shift rules (PSRs) are key techniques for efficient gradient estimation in variational quantum eigensolvers (VQEs). In this paper, we propose its Bayesian variant, where Gaussian processes with appropriate kernels are used to estimate the…

May 7, 2026

EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware Distillation

arXiv:2605.04062v1 Announce Type: new Abstract: Recent years have witnessed an increasing interest in deploying LLMs on resource-constrained devices, among which quantization has emerged as a promising lightweight technique that converts full-precision model weights and activations into lower-bit formats. Existing weight…

May 7, 2026

Learning to Orchestrate Agents in Natural Language with the Conductor

arXiv:2512.04388v5 Announce Type: replace Abstract: Powerful large language models (LLMs) from different providers have been expensively trained and finetuned to specialize across varying domains. In this work, we introduce a new kind of Conductor model trained with reinforcement learning to…

May 7, 2026

Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer’s Disease Progression Analysis

arXiv:2605.04063v1 Announce Type: new Abstract: Alzheimer’s Dementia (AD) is a progressive neurodegenerative disease marked by irreversible decline, making reliable modeling of its progression essential for effective patient care. Progression-aware methods such as survival analysis are therefore crucial tools for the…

May 7, 2026