SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published 1 day ago • 41
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 8 days ago • 242
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 8 days ago • 242
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 8 days ago • 242
laion/CLIP-ViT-L-14-laion2B-s32B-b82K Zero-Shot Image Classification • 0.4B • Updated Jan 16, 2024 • 369k • 62