Archives AI News

Discrete State Diffusion Models: A Sample Complexity Perspective

arXiv:2510.10854v2 Announce Type: replace Abstract: Diffusion models have demonstrated remarkable performance in generating high-dimensional samples across domains such as vision, language, and the sciences. Although continuous-state diffusion models have been extensively studied both empirically and theoretically, discrete-state diffusion models, essential…

NeuroPareto: Calibrated Acquisition for Costly Many-Goal Search in Vast Parameter Spaces

arXiv:2602.03901v2 Announce Type: replace Abstract: The pursuit of optimal trade-offs in high-dimensional search spaces under stringent computational constraints poses a fundamental challenge for contemporary multi-objective optimization. We develop NeuroPareto, a cohesive architecture that integrates rank-centric filtering, uncertainty disentanglement, and history-conditioned…

Singular Vectors of Attention Heads Align with Features

arXiv:2602.13524v1 Announce Type: new Abstract: Identifying feature representations in language models is a central task in mechanistic interpretability. Several recent studies have made an implicit assumption that feature representations can be inferred in some cases from singular vectors of attention…

Guaranteed Nonconvex Low-Rank Tensor Estimation via Scaled Gradient Descent

arXiv:2501.01696v2 Announce Type: replace-cross Abstract: Tensors, which give a faithful and effective representation to deliver the intrinsic structure of multi-dimensional data, play a crucial role in an increasing number of signal processing and machine learning problems. However, tensor data are…

QuaRK: A Quantum Reservoir Kernel for Time Series Learning

arXiv:2602.13531v1 Announce Type: new Abstract: Quantum reservoir computing offers a promising route for time series learning by modelling sequential data via rich quantum dynamics while the only training required happens at the level of a lightweight classical readout. However, studies…

Out-of-Support Generalisation via Weight Space Sequence Modelling

arXiv:2602.13550v1 Announce Type: new Abstract: As breakthroughs in deep learning transform key industries, models are increasingly required to extrapolate on datapoints found outside the range of the training set, a challenge we coin as out-of-support (OoS) generalisation. However, neural networks…