Archives AI News

Muon+: Towards Better Muon via One Additional Normalization Step

arXiv:2602.21545v2 Announce Type: replace Abstract: The Muon optimizer has demonstrated promising performance in pre-training large language models through gradient (or momentum) orthogonalization. In this work, we propose a simple yet effective enhancement to Muon, namely Muon+, which introduces an additional…

Generative Value Conflicts Reveal LLM Priorities

arXiv:2509.25369v2 Announce Type: replace-cross Abstract: Past work seeks to align large language model (LLM)-based assistants with a target set of values, but such assistants are frequently forced to make tradeoffs between values when deployed. In response to the scarcity of…

BrepCoder: A Unified Multimodal Large Language Model for Multi-task B-rep Reasoning

arXiv:2602.22284v1 Announce Type: new Abstract: Recent advancements in deep learning have actively addressed complex challenges within the Computer-Aided Design (CAD) domain.However, most existing approaches rely on task-specifi c models requiring structural modifi cations for new tasks, and they predominantly focus…

Flow Matching is Adaptive to Manifold Structures

arXiv:2602.22486v1 Announce Type: cross Abstract: Flow matching has emerged as a simulation-free alternative to diffusion-based generative modeling, producing samples by solving an ODE whose time-dependent velocity field is learned along an interpolation between a simple source distribution (e.g., a standard…

Search-P1: Path-Centric Reward Shaping for Stable and Efficient Agentic RAG Training

arXiv:2602.22576v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by incorporating external knowledge, yet traditional single-round retrieval struggles with complex multi-step reasoning. Agentic RAG addresses this by enabling LLMs to dynamically decide when and what to…

OmniZip: Learning a Unified and Lightweight Lossless Compressor for Multi-Modal Data

arXiv:2602.22286v1 Announce Type: new Abstract: Lossless compression is essential for efficient data storage and transmission. Although learning-based lossless compressors achieve strong results, most of them are designed for a single modality, leading to redundant compressor deployments in multi-modal settings. Designing…