Archives AI News

From Bits to Chips: An LLM-based Hardware-Aware Quantization Agent for Streamlined Deployment of LLMs

arXiv:2601.03484v1 Announce Type: new Abstract: Deploying models, especially large language models (LLMs), is becoming increasingly attractive to a broader user base, including those without specialized expertise. However, due to the resource constraints of certain hardware, maintaining high accuracy with larger…

January 8, 2026

Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention

arXiv:2511.06294v3 Announce Type: replace Abstract: Recent advances in Transformer-based Neural Operators have enabled significant progress in data-driven solvers for Partial Differential Equations (PDEs). Most current research has focused on reducing the quadratic complexity of attention to address the resulting low…

January 8, 2026

VeRPO: Verifiable Dense Reward Policy Optimization for Code Generation

arXiv:2601.03525v1 Announce Type: new Abstract: Effective reward design is a central challenge in Reinforcement Learning (RL) for code generation. Mainstream pass/fail outcome rewards enforce functional correctness via executing unit tests, but the resulting sparsity limits potential performance gains. While recent…

January 8, 2026

Improving Underwater Acoustic Classification Through Learnable Gabor Filter Convolution and Attention Mechanisms

arXiv:2512.14714v2 Announce Type: replace Abstract: Remotely detecting and classifying underwater acoustic targets is critical for environmental monitoring and defence. However, the complexity of ship-radiated and environmental noise poses significant challenges for accurate signal processing. While recent advancements in machine learning…

January 8, 2026

Data relativistic uncertainty framework for low-illumination anime scenery image enhancement

arXiv:2512.21944v2 Announce Type: replace-cross Abstract: By contrast with the prevailing works of low-light enhancement in natural images and videos, this study copes with the low-illumination quality degradation in anime scenery images to bridge the domain gap. For such an underexplored…

January 8, 2026

Beyond Physical Labels: Redefining Domains for Robust WiFi-based Gesture Recognition

arXiv:2601.03825v1 Announce Type: cross Abstract: In this paper, we propose GesFi, a novel WiFi-based gesture recognition system that introduces WiFi latent domain mining to redefine domains directly from the data itself. GesFi first processes raw sensing data collected from WiFi…

January 8, 2026

SQL2Circuits: Estimating Cardinalities, Execution Times, and Costs for SQL Queries with Quantum Natural Language Processing

arXiv:2306.08529v3 Announce Type: replace-cross Abstract: Recent advances in quantum computing have led to progress in exploring quantum applications across diverse fields, including databases and data management. This work presents a quantum machine learning model that tackles the challenge of estimating…

January 8, 2026

HONEYBEE: Efficient Role-based Access Control for Vector Databases via Dynamic Partitioning[Technical Report]

arXiv:2505.01538v3 Announce Type: replace-cross Abstract: Enterprise deployments of vector databases require access control policies to protect sensitive data. These systems often implement access control through hybrid vector queries that combine nearest-neighbor search with relational predicates based on user permissions. However,…

January 8, 2026

Low Resource Reconstruction Attacks Through Benign Prompts

arXiv:2507.07947v3 Announce Type: replace Abstract: Recent advances in generative models, such as diffusion models, have raised concerns related to privacy, copyright infringement, and data stewardship. To better understand and control these risks, prior work has introduced techniques and attacks that…

January 8, 2026

The Mean-Field Dynamics of Transformers

arXiv:2512.01868v3 Announce Type: replace Abstract: We develop a mathematical framework that interprets Transformer attention as an interacting particle system and studies its continuum (mean-field) limits. By idealizing attention on the sphere, we connect Transformer dynamics to Wasserstein gradient flows, synchronization…

January 8, 2026