Flash-WAM: Modality-Aware Distillation for World Action Models Paper • 2606.05254 • Published 4 days ago • 4
LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation Paper • 2606.02553 • Published 6 days ago • 19
NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation Paper • 2606.03159 • Published 5 days ago • 21
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer Paper • 2605.30940 • Published 9 days ago • 37
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Paper • 2605.30346 • Published 10 days ago • 54
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 10 days ago • 57
OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning Paper • 2605.28691 • Published 11 days ago • 24
JLT: Clean-Latent Prediction in Latent Diffusion Transformers Paper • 2605.27102 • Published 12 days ago • 32
GEN3C Collection 3D-Informed World-Consistent Video Generation with Precise Camera Control • 3 items • Updated 3 days ago • 9
Geo-Align: Video Generation Alignment via Metric Geometry Reward Paper • 2605.23903 • Published 16 days ago • 10
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 18 days ago • 204
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 25 days ago • 159
FlowLong: Inference-time Long Video Generation via Manifold-constrained Tweedie Matching Paper • 2605.20910 • Published 18 days ago • 29
UniT: Unified Geometry Learning with Group Autoregressive Transformer Paper • 2605.21131 • Published 18 days ago • 8