Archives AI News

Exploring Pass-Rate Reward in Reinforcement Learning for Code Generation

arXiv:2605.02944v1 Announce Type: new Abstract: Reinforcement learning (RL) from unit-test feedback has become a standard post-training recipe for improving large language models (LLMs) on code generation. However, the pass-all-tests binary reward can be sparse, yielding no learning signal on challenging…

RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs

arXiv:2605.02946v1 Announce Type: new Abstract: Safety alignment is critical for the responsible deployment of large language models (LLMs). As Mixture-of-Experts (MoE) architectures are increasingly adopted to scale model capacity, understanding their safety robustness becomes essential. Existing adversarial attacks, however, have…

Fisher Decorator: Refining Flow Policy via a Local Transport Map

arXiv:2604.17919v2 Announce Type: replace Abstract: Recent advances in flow-based offline reinforcement learning (RL) have achieved strong performance by parameterizing policies via flow matching. However, they still face critical trade-offs among expressiveness, optimality, and efficiency. In particular, existing flow policies interpret…

InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy

arXiv:2507.02974v3 Announce Type: replace Abstract: As major progress in LLM-based long-form text generation enables paradigms such as retrieval-augmented generation (RAG) and inference-time scaling, safely incorporating private information into the generation remains a critical open question. We present InvisibleInk, a highly…