Empowering Multimodal Foundation Models with Generalized Visual Search
Kaican Li PRO
m-Just
AI & ML interests
None yet
Recent Activity
upvoted a paper about 9 hours ago
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data upvoted a paper about 9 hours ago
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios upvoted a paper about 9 hours ago
VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning? Organizations
None yet