LiftQuant: Continuous Bit-Width LLM via Dimensional Lifting and Projection
arXiv:2606.04050v1 Announce Type: new Abstract: Existing quantization methods are fundamentally limited by rigid, integer-based bit-widths (e.g., 2, 3-bit), resulting in a “deployment gap” where Large Language Models cannot be optimally fitted to specific memory budgets. To bridge this gap, we…
