Direct 3D-Aware Object Insertion via Decomposed Visual Proxies Paper β’ 2606.06601 β’ Published 17 days ago β’ 26
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper β’ 2605.12500 β’ Published May 12 β’ 193
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper β’ 2603.03143 β’ Published Mar 3 β’ 145
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper β’ 2603.03143 β’ Published Mar 3 β’ 145
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper β’ 2512.19693 β’ Published Dec 22, 2025 β’ 68
Runtime error Agents 69 Wan 2 2 First Last Frame π» 69 Generate a video by interpolating between two images with a text prompt