Archives AI News

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents

arXiv:2602.23556v1 Announce Type: new Abstract: Large-scale Graph Neural Networks (GNNs) are typically trained by sampling a vertex’s neighbors to a fixed distance. Because large input graphs are distributed, training requires frequent irregular communication that stalls forward progress. Moreover, fetched data…

March 2, 2026

Convex Loss Functions for Support Vector Machines (SVMs) and Neural Networks

arXiv:2601.21331v4 Announce Type: replace Abstract: We propose a new convex loss for Support Vector Machines, both for the binary classification and for the regression models. Therefore, we show the mathematical derivation of the dual problems and we experiment with them…

March 2, 2026

Dynamics of Learning under User Choice: Overspecialization and Peer-Model Probing

arXiv:2602.23565v1 Announce Type: new Abstract: In many economically relevant contexts where machine learning is deployed, multiple platforms obtain data from the same pool of users, each of whom selects the platform that best serves them. Prior work in this setting…

March 2, 2026

Test-Time Training with KV Binding Is Secretly Linear Attention

arXiv:2602.21204v2 Announce Type: replace Abstract: Test-time training (TTT) with KV binding as sequence modeling layer is commonly interpreted as a form of online meta-learning that memorizes a key-value mapping at test time. However, our analysis reveals multiple phenomena that contradict…

March 2, 2026

Flowette: Flow Matching with Graphette Priors for Graph Generation

arXiv:2602.23566v1 Announce Type: new Abstract: We study generative modeling of graphs with recurring subgraph motifs. We propose Flowette, a continuous flow matching framework, that employs a graph neural network based transformer to learn a velocity field defined over graph representations…

March 2, 2026

FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data

arXiv:2502.18471v2 Announce Type: replace-cross Abstract: Large language models (LLMs) excel at generating human-like responses but often struggle with interactive tasks that require access to real-time information. This limitation poses challenges in finance, where models must access up-to-date information, such as…

March 2, 2026

Hybrid Quantum Temporal Convolutional Networks

arXiv:2602.23578v1 Announce Type: new Abstract: Quantum machine learning models for sequential data face scalability challenges with complex multivariate signals. We introduce the Hybrid Quantum Temporal Convolutional Network (HQTCN), which combines classical temporal windowing with a quantum convolutional neural network core.…

March 2, 2026

Alignment through Meta-Weighted Online Sampling: Bridging the Gap between Data Generation and Preference Optimization

arXiv:2509.23371v2 Announce Type: replace-cross Abstract: Preference optimization is crucial for aligning large language models (LLMs) with human values and intentions. A significant challenge in this process is the distribution mismatch between pre-collected offline preference data and the evolving model policy.…

March 2, 2026

SDMixer: Sparse Dual-Mixer for Time Series Forecasting

arXiv:2602.23581v1 Announce Type: new Abstract: Multivariate time series forecasting is widely applied in fields such as transportation, energy, and finance. However, the data commonly suffers from issues of multi-scale characteristics, weak correlations, and noise interference, which limit the predictive performance…

March 2, 2026

Learning to Optimize by Differentiable Programming

arXiv:2601.16510v2 Announce Type: replace-cross Abstract: Solving massive-scale optimization problems requires scalable first-order methods with low per-iteration cost. This tutorial highlights a shift in optimization: using differentiable programming not only to execute algorithms but to learn how to design them. Modern…

March 2, 2026