Archives AI News

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

arXiv:2412.14590v2 Announce Type: replace Abstract: Quantization has become one of the most effective methodologies to compress LLMs into smaller size. However, the existing quantization solutions still show limitations of either non-negligible accuracy drop or low system efficiency. In this paper,…

April 23, 2026

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

April 23, 2026

On the Quantization Robustness of Diffusion Language Models in Coding Benchmarks

arXiv:2604.20079v1 Announce Type: new Abstract: Auto-regressive Large Language Models (LLMs) achieve strong performance on coding tasks, but incur high memory and inference costs. Diffusion-based language models (d-LLMs) offer bounded inference cost via iterative denoising, but their behavior under post-training quantization…

April 23, 2026

On the Quantization Robustness of Diffusion Language Models in Coding Benchmarks

April 23, 2026

Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL

arXiv:2506.20904v2 Announce Type: replace Abstract: We study offline reinforcement learning in average-reward MDPs, which presents increased challenges from the perspectives of distribution shift and non-uniform coverage, and has been relatively underexamined from a theoretical perspective. While previous work obtains performance…

April 23, 2026

Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL

April 23, 2026

Concept Graph Convolutions: Message Passing in the Concept Space

arXiv:2604.20082v1 Announce Type: new Abstract: The trust in the predictions of Graph Neural Networks is limited by their opaque reasoning process. Prior methods have tried to explain graph networks via concept-based explanations extracted from the latent representations obtained after message…

April 23, 2026

Evaluating the Quality of the Quantified Uncertainty for (Re)Calibration of Data-Driven Regression Models

arXiv:2508.17761v3 Announce Type: replace Abstract: In safety-critical applications data-driven models must not only be accurate but also provide reliable uncertainty estimates. This property, commonly referred to as calibration, is essential for risk-aware decision-making. In regression a wide variety of calibration…

April 23, 2026

Gauge-covariant stochastic neural fields: Stability and finite-width effects

arXiv:2508.18948v2 Announce Type: replace-cross Abstract: We develop a gauge-covariant stochastic effective field theory for stability and finite-width effects in deep neural systems. The model uses classical commuting fields: a complex matter field, a real Abelian connection field, and a fictitious…

April 23, 2026

New chip can protect wireless biomedical devices from quantum attacks

Ultra-efficient chip design enables extremely strong cryptography algorithms to run on energy-constrained edge devices.

April 23, 2026