dUltra: Ultra-Fast Diffusion Language Models via Reinforcement Learning
arXiv:2512.21446v1 Announce Type: new Abstract: Masked diffusion language models (MDLMs) offer the potential for parallel token generation, but most open-source MDLMs decode fewer than 5 tokens per model forward pass even with sophisticated sampling strategies. As a result, their sampling…
