Archives AI News

Sample Size Calculations for Developing Clinical Prediction Models: Overview and pmsims R package

arXiv:2602.23507v1 Announce Type: new Abstract: Background: Clinical prediction models are increasingly used to inform healthcare decisions, but determining the minimum sample size for their development remains a critical and unresolved challenge. Inadequate sample sizes can lead to overfitting, poor generalisability,…

March 2, 2026

InfoBridge: Mutual Information estimation via Bridge Matching

arXiv:2502.01383v4 Announce Type: replace Abstract: Diffusion bridge models have recently become a powerful tool in the field of generative modeling. In this work, we leverage their power to address another important problem in machine learning and information theory, the estimation…

March 2, 2026

Neural Operators Can Discover Functional Clusters

arXiv:2602.23528v1 Announce Type: new Abstract: Operator learning is reshaping scientific computing by amortizing inference across infinite families of problems. While neural operators (NOs) are increasingly well understood for regression, far less is known for classification and its unsupervised analogue: clustering.…

March 2, 2026

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

arXiv:2508.05629v3 Announce Type: replace Abstract: In this work, we present a simple yet theoretically motivated improvement to Supervised Fine-Tuning (SFT) for the Large Language Model (LLM), addressing its limited generalization compared to reinforcement learning (RL). Through mathematical analysis, we reveal…

March 2, 2026

Active Value Querying to Minimize Additive Error in Subadditive Set Function Learning

arXiv:2602.23529v1 Announce Type: new Abstract: Subadditive set functions play a pivotal role in computational economics (especially in combinatorial auctions), combinatorial optimization or artificial intelligence applications such as interpretable machine learning. However, specifying a set function requires assigning values to an…

March 2, 2026

The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators

arXiv:2510.06646v2 Announce Type: replace Abstract: A core challenge in scientific machine learning, and scientific computing more generally, is modeling continuous phenomena which (in practice) are represented discretely. Machine-learned operators (MLOs) have been introduced as a means to achieve this modeling…

March 2, 2026

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents

arXiv:2602.23556v1 Announce Type: new Abstract: Large-scale Graph Neural Networks (GNNs) are typically trained by sampling a vertex’s neighbors to a fixed distance. Because large input graphs are distributed, training requires frequent irregular communication that stalls forward progress. Moreover, fetched data…

March 2, 2026

Convex Loss Functions for Support Vector Machines (SVMs) and Neural Networks

arXiv:2601.21331v4 Announce Type: replace Abstract: We propose a new convex loss for Support Vector Machines, both for the binary classification and for the regression models. Therefore, we show the mathematical derivation of the dual problems and we experiment with them…

March 2, 2026

Dynamics of Learning under User Choice: Overspecialization and Peer-Model Probing

arXiv:2602.23565v1 Announce Type: new Abstract: In many economically relevant contexts where machine learning is deployed, multiple platforms obtain data from the same pool of users, each of whom selects the platform that best serves them. Prior work in this setting…

March 2, 2026

Test-Time Training with KV Binding Is Secretly Linear Attention

arXiv:2602.21204v2 Announce Type: replace Abstract: Test-time training (TTT) with KV binding as sequence modeling layer is commonly interpreted as a form of online meta-learning that memorizes a key-value mapping at test time. However, our analysis reveals multiple phenomena that contradict…

March 2, 2026