Avatar V: Scaling Video-Reference Avatar Video Generation Paper • 2606.13872 • Published 11 days ago • 9
OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 11 days ago • 105
Echo-Memory: A Controlled Study of Memory in Action World Models Paper • 2606.09803 • Published 14 days ago • 32
AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation Paper • 2606.03972 • Published 20 days ago • 14
Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation Paper • 2606.04527 • Published 19 days ago • 28
StreamChar: Long-Horizon Streaming Character Audio-Video Generation with Decoupled Orchestration Paper • 2605.25659 • Published 28 days ago • 16
VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion Paper • 2605.30351 • Published 25 days ago • 26
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 27 days ago • 144
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published 28 days ago • 103
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published May 18 • 115
WorldMark: A Unified Benchmark Suite for Interactive Video World Models Paper • 2604.21686 • Published Apr 23 • 36