GenCtrl -- A Formal Controllability Toolkit for Generative Models Paper • 2601.05637 • Published 3 days ago • 2
Klear: Unified Multi-Task Audio-Video Joint Generation Paper • 2601.04151 • Published 5 days ago • 13
google/gemma-3-12b-it-qat-q4_0-unquantized Image-Text-to-Text • 12B • Updated Apr 15, 2025 • 8.05k • • 35
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 14 days ago • 64
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 10 days ago • 51
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published 11 days ago • 110
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion Paper • 2512.23709 • Published 14 days ago • 48
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 14 days ago • 5