Archives AI News

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

arXiv:2508.05629v3 Announce Type: replace Abstract: In this work, we present a simple yet theoretically motivated improvement to Supervised Fine-Tuning (SFT) for the Large Language Model (LLM), addressing its limited generalization compared to reinforcement learning (RL). Through mathematical analysis, we reveal…

March 2, 2026

Active Value Querying to Minimize Additive Error in Subadditive Set Function Learning

arXiv:2602.23529v1 Announce Type: new Abstract: Subadditive set functions play a pivotal role in computational economics (especially in combinatorial auctions), combinatorial optimization or artificial intelligence applications such as interpretable machine learning. However, specifying a set function requires assigning values to an…

March 2, 2026

The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators

arXiv:2510.06646v2 Announce Type: replace Abstract: A core challenge in scientific machine learning, and scientific computing more generally, is modeling continuous phenomena which (in practice) are represented discretely. Machine-learned operators (MLOs) have been introduced as a means to achieve this modeling…

March 2, 2026

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents

arXiv:2602.23556v1 Announce Type: new Abstract: Large-scale Graph Neural Networks (GNNs) are typically trained by sampling a vertex’s neighbors to a fixed distance. Because large input graphs are distributed, training requires frequent irregular communication that stalls forward progress. Moreover, fetched data…

March 2, 2026

Convex Loss Functions for Support Vector Machines (SVMs) and Neural Networks

arXiv:2601.21331v4 Announce Type: replace Abstract: We propose a new convex loss for Support Vector Machines, both for the binary classification and for the regression models. Therefore, we show the mathematical derivation of the dual problems and we experiment with them…

March 2, 2026

Dynamics of Learning under User Choice: Overspecialization and Peer-Model Probing

arXiv:2602.23565v1 Announce Type: new Abstract: In many economically relevant contexts where machine learning is deployed, multiple platforms obtain data from the same pool of users, each of whom selects the platform that best serves them. Prior work in this setting…

March 2, 2026

Test-Time Training with KV Binding Is Secretly Linear Attention

arXiv:2602.21204v2 Announce Type: replace Abstract: Test-time training (TTT) with KV binding as sequence modeling layer is commonly interpreted as a form of online meta-learning that memorizes a key-value mapping at test time. However, our analysis reveals multiple phenomena that contradict…

March 2, 2026

Flowette: Flow Matching with Graphette Priors for Graph Generation

arXiv:2602.23566v1 Announce Type: new Abstract: We study generative modeling of graphs with recurring subgraph motifs. We propose Flowette, a continuous flow matching framework, that employs a graph neural network based transformer to learn a velocity field defined over graph representations…

March 2, 2026

FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data

arXiv:2502.18471v2 Announce Type: replace-cross Abstract: Large language models (LLMs) excel at generating human-like responses but often struggle with interactive tasks that require access to real-time information. This limitation poses challenges in finance, where models must access up-to-date information, such as…

March 2, 2026

Hybrid Quantum Temporal Convolutional Networks

arXiv:2602.23578v1 Announce Type: new Abstract: Quantum machine learning models for sequential data face scalability challenges with complex multivariate signals. We introduce the Hybrid Quantum Temporal Convolutional Network (HQTCN), which combines classical temporal windowing with a quantum convolutional neural network core.…

March 2, 2026