yu's picture

yu

yqi19

·

https://github.com/yqi19

yqi19

AI & ML interests

Enthusiastic Music Lover🎵🎻

Recent Activity

new activity about 12 hours ago

HuggingFriends/mllm-as-embodied-world-judge:Add Cosmos3-Nano image2video generations (cosmos3_prefix + cosmos3_rewrite)

updated a dataset about 12 hours ago

yqi19/cosmos_generated_videos

published a dataset about 13 hours ago

yqi19/cosmos_generated_videos

View all activity

Organizations

None yet

authored 4 papers 8 months ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published Oct 30, 2025 • 34

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 6

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13, 2025 • 28

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published Oct 9, 2025 • 46