Archives AI News

Singular Vectors of Attention Heads Align with Features

arXiv:2602.13524v1 Announce Type: new Abstract: Identifying feature representations in language models is a central task in mechanistic interpretability. Several recent studies have made an implicit assumption that feature representations can be inferred in some cases from singular vectors of attention…

Guaranteed Nonconvex Low-Rank Tensor Estimation via Scaled Gradient Descent

arXiv:2501.01696v2 Announce Type: replace-cross Abstract: Tensors, which give a faithful and effective representation to deliver the intrinsic structure of multi-dimensional data, play a crucial role in an increasing number of signal processing and machine learning problems. However, tensor data are…

QuaRK: A Quantum Reservoir Kernel for Time Series Learning

arXiv:2602.13531v1 Announce Type: new Abstract: Quantum reservoir computing offers a promising route for time series learning by modelling sequential data via rich quantum dynamics while the only training required happens at the level of a lightweight classical readout. However, studies…

Out-of-Support Generalisation via Weight Space Sequence Modelling

arXiv:2602.13550v1 Announce Type: new Abstract: As breakthroughs in deep learning transform key industries, models are increasingly required to extrapolate on datapoints found outside the range of the training set, a challenge we coin as out-of-support (OoS) generalisation. However, neural networks…

Quantum Reservoir Computing with Neutral Atoms on a Small, Complex, Medical Dataset

arXiv:2602.14641v1 Announce Type: cross Abstract: Biomarker-based prediction of clinical outcomes is challenging due to nonlinear relationships, correlated features, and the limited size of many medical datasets. Classical machine-learning methods can struggle under these conditions, motivating the search for alternatives. In…

Scenario-Adaptive MU-MIMO OFDM Semantic Communication With Asymmetric Neural Network

arXiv:2602.13557v1 Announce Type: new Abstract: Semantic Communication (SemCom) has emerged as a promising paradigm for 6G networks, aiming to extract and transmit task-relevant information rather than minimizing bit errors. However, applying SemCom to realistic downlink Multi-User Multi-Input Multi-Output (MU-MIMO) Orthogonal…