DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Paper • 2603.28713 • Published 5 days ago • 17
STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding Paper • 2603.27593 • Published 6 days ago • 10
MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation Paper • 2603.29029 • Published 5 days ago • 12
j05hr3d/Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED999 Text Generation • 3B • Updated 4 days ago • 322 • 1
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 18 days ago • 248