Archives AI News

Exploring Pass-Rate Reward in Reinforcement Learning for Code Generation

arXiv:2605.02944v1 Announce Type: new Abstract: Reinforcement learning (RL) from unit-test feedback has become a standard post-training recipe for improving large language models (LLMs) on code generation. However, the pass-all-tests binary reward can be sparse, yielding no learning signal on challenging…

RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs

arXiv:2605.02946v1 Announce Type: new Abstract: Safety alignment is critical for the responsible deployment of large language models (LLMs). As Mixture-of-Experts (MoE) architectures are increasingly adopted to scale model capacity, understanding their safety robustness becomes essential. Existing adversarial attacks, however, have…

Parametrizing Convex Sets Using Sublinear Neural Networks

arXiv:2605.03520v1 Announce Type: cross Abstract: We propose a neural parameterization of convex sets by learning sublinear (positively homogeneous and convex) functions. Our networks implicitly represent both the support and gauge functions of a convex body. We prove a universal approximation…

Exact and Approximate Algorithms for Polytree Learning

arXiv:2605.03622v1 Announce Type: cross Abstract: Polytrees are a subclass of Bayesian networks that seek to capture the conditional dependencies between a set of $n$ variables as a directed forest and are motivated by their more efficient inference and improved interpretability.…

Disease Is a Spectral Perturbation

arXiv:2605.02949v1 Announce Type: new Abstract: We propose a novel method of understanding disease transformation from a healthy baseline with biomarker-level explainability. By modeling the biomarker covariance matrices of healthy controls and disease states, the perturbation can be individually characterized to…

Training-Free Probabilistic Time-Series Forecasting with Conformal Seasonal Pools

arXiv:2605.03789v1 Announce Type: cross Abstract: We propose Conformal Seasonal Pools (CSP), a training-free probabilistic time-series forecaster that mixes same-season empirical draws with signed residual draws around a seasonal naive forecast. In an audited rolling-origin benchmark on the six time-series datasets…