VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
Paper • 2603.22285 • Published • 49
None defined yet.
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion