Archives AI News

Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models

arXiv:2510.18053v1 Announce Type: new Abstract: Balancing exploration and exploitation during reinforcement learning fine-tuning of generative models presents a critical challenge, as existing approaches rely on fixed divergence regularization that creates an inherent dilemma: strong regularization preserves model capabilities but limits…

Adapting Language Balance in Code-Switching Speech

arXiv:2510.18724v1 Announce Type: cross Abstract: Despite achieving impressive results on standard benchmarks, large foundational models still struggle against code-switching test cases. When data scarcity cannot be used as the usual justification for poor performance, the reason may lie in the…

SPACeR: Self-Play Anchoring with Centralized Reference Models

arXiv:2510.18060v1 Announce Type: new Abstract: Developing autonomous vehicles (AVs) requires not only safety and efficiency, but also realistic, human-like behaviors that are socially aware and predictable. Achieving this requires sim agent policies that are human-like, fast, and scalable in multi-agent…

Fine-tuning Flow Matching Generative Models with Intermediate Feedback

arXiv:2510.18072v1 Announce Type: new Abstract: Flow-based generative models have shown remarkable success in text-to-image generation, yet fine-tuning them with intermediate feedback remains challenging, especially for continuous-time flow matching models. Most existing approaches solely learn from outcome rewards, struggling with the…

Enabling Automatic Differentiation with Mollified Graph Neural Operators

arXiv:2504.08277v2 Announce Type: replace Abstract: Physics-informed neural operators offer a powerful framework for learning solution operators of partial differential equations (PDEs) by combining data and physics losses. However, these physics losses rely on derivatives. Computing these derivatives remains challenging, with…