view article Article Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents nvidia • Apr 28 • 62
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos Paper • 2603.14145 • Published Mar 14 • 15
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16 Image-Text-to-Text • 13B • Updated Dec 2, 2025 • 147k • 84
view article Article Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub nvidia • Jun 27, 2025 • 31
Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents Paper • 2502.04223 • Published Feb 6, 2025 • 10