Archives AI News

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents

arXiv:2602.23556v1 Announce Type: new Abstract: Large-scale Graph Neural Networks (GNNs) are typically trained by sampling a vertex’s neighbors to a fixed distance. Because large input graphs are distributed, training requires frequent irregular communication that stalls forward progress. Moreover, fetched data…

Test-Time Training with KV Binding Is Secretly Linear Attention

arXiv:2602.21204v2 Announce Type: replace Abstract: Test-time training (TTT) with KV binding as sequence modeling layer is commonly interpreted as a form of online meta-learning that memorizes a key-value mapping at test time. However, our analysis reveals multiple phenomena that contradict…

Flowette: Flow Matching with Graphette Priors for Graph Generation

arXiv:2602.23566v1 Announce Type: new Abstract: We study generative modeling of graphs with recurring subgraph motifs. We propose Flowette, a continuous flow matching framework, that employs a graph neural network based transformer to learn a velocity field defined over graph representations…

FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data

arXiv:2502.18471v2 Announce Type: replace-cross Abstract: Large language models (LLMs) excel at generating human-like responses but often struggle with interactive tasks that require access to real-time information. This limitation poses challenges in finance, where models must access up-to-date information, such as…

Hybrid Quantum Temporal Convolutional Networks

arXiv:2602.23578v1 Announce Type: new Abstract: Quantum machine learning models for sequential data face scalability challenges with complex multivariate signals. We introduce the Hybrid Quantum Temporal Convolutional Network (HQTCN), which combines classical temporal windowing with a quantum convolutional neural network core.…

SDMixer: Sparse Dual-Mixer for Time Series Forecasting

arXiv:2602.23581v1 Announce Type: new Abstract: Multivariate time series forecasting is widely applied in fields such as transportation, energy, and finance. However, the data commonly suffers from issues of multi-scale characteristics, weak correlations, and noise interference, which limit the predictive performance…

Learning to Optimize by Differentiable Programming

arXiv:2601.16510v2 Announce Type: replace-cross Abstract: Solving massive-scale optimization problems requires scalable first-order methods with low per-iteration cost. This tutorial highlights a shift in optimization: using differentiable programming not only to execute algorithms but to learn how to design them. Modern…