Archives AI News

Learning to Ball: Composing Policies for Long-Horizon Basketball Moves

arXiv:2509.22442v1 Announce Type: cross Abstract: Learning a control policy for a multi-phase, long-horizon task, such as basketball maneuvers, remains challenging for reinforcement learning approaches due to the need for seamless policy composition and transitions between skills. A long-horizon task typically…

d2: Improved Techniques for Training Reasoning Diffusion Language Models

arXiv:2509.21474v1 Announce Type: new Abstract: While diffusion language models (DLMs) have achieved competitive performance in text generation, improving their reasoning ability with reinforcement learning remains an active research area. Here, we introduce d2, a reasoning framework tailored for masked DLMs.…

Filtering with Confidence: When Data Augmentation Meets Conformal Prediction

arXiv:2509.21479v1 Announce Type: new Abstract: With promising empirical performance across a wide range of applications, synthetic data augmentation appears a viable solution to data scarcity and the demands of increasingly data-intensive models. Its effectiveness lies in expanding the training set…

WeightLoRA: Keep Only Necessary Adapters

arXiv:2506.02724v2 Announce Type: replace Abstract: The widespread utilization of language models in modern applications is inconceivable without Parameter-Efficient Fine-Tuning techniques, such as low-rank adaptation ($texttt{LoRA}$), which adds trainable adapters to selected layers. Although $texttt{LoRA}$ may obtain accurate solutions, it requires…

High-Probability Analysis of Online and Federated Zero-Order Optimisation

arXiv:2509.21484v1 Announce Type: new Abstract: We study distributed learning in the setting of gradient-free zero-order optimization and introduce FedZero, a federated zero-order algorithm that delivers sharp theoretical guarantees. Specifically, FedZero: (1) achieves near-optimal optimization error bounds with high probability in…

AI for Scientific Discovery is a Social Problem

arXiv:2509.06580v3 Announce Type: replace Abstract: Artificial intelligence promises to accelerate scientific discovery, yet its benefits remain unevenly distributed. While technical obstacles such as scarce data, fragmented standards, and unequal access to computation are significant, we argue that the primary barriers…