A Blog post by NVIDIA on Hugging Face
NVIDIA releases Nemotron-Labs-Diffusion, a tri-mode LM unifying AR, diffusion, and self-speculation decoding at 5.99× tokens per forward.