Archives AI News

MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models

arXiv:2508.02343v2 Announce Type: replace Abstract: Quantization significantly accelerates inference in large language models (LLMs) by replacing original high-precision matrices with low-precision counterparts. Recent advances in weight-activation quantization have primarily focused on mapping both weights and activations to the INT4 format.…

April 1, 2026

Stable Reasoning, Unstable Responses: Mitigating LLM Deception via Stability Asymmetry

arXiv:2603.26846v1 Announce Type: new Abstract: As Large Language Models (LLMs) expand in capability and application scope, their trustworthiness becomes critical. A vital risk is intrinsic deception, wherein models strategically mislead users to achieve their own objectives. Existing alignment approaches based…

April 1, 2026

A Hierarchical Sheaf Spectral Embedding Framework for Single-Cell RNA-seq Analysis

arXiv:2603.26858v1 Announce Type: new Abstract: Single-cell RNA-seq data analysis typically requires representations that capture heterogeneous local structure across multiple scales while remaining stable and interpretable. In this work, we propose a hierarchical sheaf spectral embedding (HSSE) framework that constructs informative…

April 1, 2026

Electricity Price Forecasting: Bridging Linear Models, Neural Networks and Online Learning

arXiv:2601.02856v3 Announce Type: replace Abstract: Precise day-ahead forecasts for electricity prices are crucial to ensure efficient portfolio management, support strategic decision-making for power plant operations, enable efficient battery storage optimization, and facilitate demand response planning. However, developing an accurate prediction…

April 1, 2026

Property-Guided Molecular Generation and Optimization via Latent Flows

arXiv:2603.26889v1 Announce Type: new Abstract: Molecular discovery is increasingly framed as an inverse design problem: identifying molecular structures that satisfy desired property profiles under feasibility constraints. While recent generative models provide continuous latent representations of chemical space, targeted optimization within…

April 1, 2026

Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection

arXiv:2603.04427v4 Announce Type: replace Abstract: Standard Transformer attention uses identical dimensionality for queries, keys, and values, yet these components serve different roles: queries and keys produce scalar attention weights (selection), while values carry rich representations (value transfer). We show that…

April 1, 2026

Strategic Candidacy in Generative AI Arenas

arXiv:2603.26891v1 Announce Type: new Abstract: AI arenas, which rank generative models from pairwise preferences of users, are a popular method for measuring the relative performance of models in the course of their organic use. Because rankings are computed from noisy…

April 1, 2026

Preview tool helps makers visualize 3D-printed objects

By quickly generating aesthetically accurate previews of fabricated objects, the VisiPrint system could make prototyping faster and less wasteful.

April 1, 2026

Empirical Likelihood for Nonsmooth Functionals

arXiv:2603.27743v1 Announce Type: cross Abstract: Empirical likelihood is an attractive inferential framework that respects natural parameter boundaries, but existing approaches typically require smoothness of the functional and miscalibrate substantially when these assumptions are violated. For the optimal-value functional central to…

April 1, 2026

On the Hardness of Reinforcement Learning with Transition Look-Ahead

arXiv:2510.19372v2 Announce Type: replace-cross Abstract: We study reinforcement learning (RL) with transition look-ahead, where the agent may observe which states would be visited upon playing any sequence of $ell$ actions before deciding its course of action. While such predictive information…

April 1, 2026