Archives AI News

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

arXiv:2509.16989v3 Announce Type: replace Abstract: Post-training quantization (PTQ) of large language models (LLMs) to extremely low bit-widths remains challenging due to the fundamental trade-off between computational efficiency and representational capacity. While existing ultra-low-bit methods rely on binary approximations or quantization-aware…

January 5, 2026

GRIT — Geometry-Aware PEFT with K-FACPreconditioning, Fisher-Guided Reprojection, andDynamic Rank Adaptation

arXiv:2601.00231v1 Announce Type: new Abstract: Parameter-efficient fine-tuning (PEFT) is the default way to adapt LLMs, but widely used LoRA and QLoRA are largely geometry-agnostic: they optimize in fixed, randomly oriented low-rank subspaces with first-order descent, mostly ignoring local loss curvature.…

January 5, 2026

Causality-Inspired Safe Residual Correction for Multivariate Time Series

arXiv:2512.22428v2 Announce Type: replace Abstract: While modern multivariate forecasters such as Transformers and GNNs achieve strong benchmark performance, they often suffer from systematic errors at specific variables or horizons and, critically, lack guarantees against performance degradation in deployment. Existing post-hoc…

January 5, 2026

Task-Driven Kernel Flows: Label Rank Compression and Laplacian Spectral Filtering

arXiv:2601.00276v1 Announce Type: new Abstract: We present a theory of feature learning in wide L2-regularized networks showing that supervised learning is inherently compressive. We derive a kernel ODE that predicts a “water-filling” spectral evolution and prove that for any stable…

January 5, 2026

MCD: Marginal Contrastive Discrimination for conditional density estimation

arXiv:2206.01592v2 Announce Type: replace-cross Abstract: We consider the problem of conditional density estimation, which is a major topic of interest in the fields of statistical and machine learning. Our method, called Marginal Contrastive Discrimination, MCD, reformulates the conditional density function…

January 5, 2026

Can Optimal Transport Improve Federated Inverse Reinforcement Learning?

arXiv:2601.00309v1 Announce Type: new Abstract: In robotics and multi-agent systems, fleets of autonomous agents often operate in subtly different environments while pursuing a common high-level objective. Directly pooling their data to learn a shared reward function is typically impractical due…

January 5, 2026

Mitigating optimistic bias in entropic risk estimation and optimization

arXiv:2409.19926v4 Announce Type: replace-cross Abstract: The entropic risk measure is widely used in high-stakes decision-making across economics, management science, finance, and safety-critical control systems because it captures tail risks associated with uncertain losses. However, when data are limited, the empirical…

January 5, 2026

New research may help scientists predict when a humid heat wave will break

As these events become more common at midlatitudes, a phenomenon called an atmospheric inversion will determine how long they last.

January 5, 2026

The Curse of Depth in Large Language Models

arXiv:2502.05795v3 Announce Type: replace Abstract: In this paper, we introduce the Curse of Depth, a concept that highlights, explains, and addresses the recent observation in modern Large Language Models (LLMs) where nearly half of the layers are less effective than…

January 5, 2026

Flattening Hierarchies with Policy Bootstrapping

arXiv:2505.14975v3 Announce Type: replace Abstract: Offline goal-conditioned reinforcement learning (GCRL) is a promising approach for pretraining generalist policies on large datasets of reward-free trajectories, akin to the self-supervised objectives used to train foundation models for computer vision and natural language…

January 5, 2026