Archives AI News

Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models

arXiv:2502.12855v2 Announce Type: replace-cross Abstract: While large models pre-trained on high-quality data exhibit excellent performance on mathematical reasoning (e.g., GSM8k, MultiArith), it remains challenging to specialize smaller models for these tasks. Common approaches to address this challenge include knowledge distillation…

Offline Reinforcement Learning via Inverse Optimization

arXiv:2502.20030v3 Announce Type: replace Abstract: Inspired by the recent successes of Inverse Optimization (IO) across various application domains, we propose a novel offline Reinforcement Learning (ORL) algorithm for continuous state and action spaces, leveraging the convex loss function called “sub-optimality…

Improving Epidemic Analyses with Privacy-Preserving Integration of Sensitive Data

arXiv:2506.22342v2 Announce Type: replace Abstract: Epidemic analyses increasingly rely on heterogeneous datasets, many of which are sensitive and require strong privacy protection. Although differential privacy (DP) has become a standard in machine learning and data sharing, its adoption in epidemiological…

Gaussian Process Limit Reveals Structural Benefits of Graph Transformers

arXiv:2603.17569v1 Announce Type: cross Abstract: Graph transformers are the state-of-the-art for learning from graph-structured data and are empirically known to avoid several pitfalls of message-passing architectures. However, there is limited theoretical analysis on why these models perform well in practice.…