World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 3 days ago • 23
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published May 1 • 84