Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
yu's picture
3 16 11

yu

yqi19
IranQin's profile picture thyjim's profile picture Tmzzzy's profile picture
·
https://github.com/yqi19
  • yqi19

AI & ML interests

Enthusiastic Music Lover🎵🎻

Recent Activity

new activity about 12 hours ago
HuggingFriends/mllm-as-embodied-world-judge:Add Cosmos3-Nano image2video generations (cosmos3_prefix + cosmos3_rewrite)
updated a dataset about 12 hours ago
yqi19/cosmos_generated_videos
published a dataset about 13 hours ago
yqi19/cosmos_generated_videos
View all activity

Organizations

None yet

authored 4 papers 8 months ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published Oct 30, 2025 • 34

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 6

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13, 2025 • 28

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published Oct 9, 2025 • 46
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs