Nemotron-Labs-Diffusion Collection A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding • 7 items • Updated 3 days ago • 48
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 562k • • 1.56k
nm-testing/DeepSeek-R1-Distill-Qwen-32B-NVFP4 Text Generation • 19B • Updated Nov 21, 2025 • 1.25k • 3