Open to Work

1 11 18

Yiming Zhao

gaotiexinqu

gaotiexinqu

AI & ML interests

VLMs, Agent, RL, Reasoning

Recent Activity

authored a paper 2 days ago

VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation

authored a paper 2 days ago

SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering

upvoted a paper 2 days ago

SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering

View all activity

Organizations

authored 2 papers 2 days ago

VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation

Paper • 2605.16079 • Published 8 days ago • 25

SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering

Paper • 2605.17526 • Published 6 days ago • 5

upvoted a paper 2 days ago

SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering

Paper • 2605.17526 • Published 6 days ago • 5

upvoted a paper 3 days ago

VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation

Paper • 2605.16079 • Published 8 days ago • 25

submitted a paper to Daily Papers 3 days ago

VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation

Paper • 2605.16079 • Published 8 days ago • 25

upvoted a paper 8 days ago

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Paper • 2605.12480 • Published 11 days ago • 4

authored 2 papers 9 days ago

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 15 days ago • 97

SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

Paper • 2605.08043 • Published 15 days ago • 10

upvoted 2 papers 11 days ago

SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation

Paper • 2605.08043 • Published 15 days ago • 10

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 15 days ago • 97

authored a paper about 1 month ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published Apr 19 • 22

liked a dataset about 1 month ago

zhang-ziao/SkillFlow-Task

Updated Apr 21 • 1.96k • 4

upvoted a paper about 1 month ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published Apr 19 • 22

liked a Space 3 months ago

Image Arena Leaderboard

📊

598

Image Generation and Image Editing Arena & Leaderboard

updated a dataset 4 months ago

gaotiexinqu/V2P-Bench

Viewer • Updated Feb 5 • 1.17k • 55 • 2

upvoted a paper 4 months ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

authored 4 papers 4 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10, 2025 • 46

V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction

Paper • 2503.17736 • Published Mar 22, 2025 • 3

Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models

Paper • 2510.01304 • Published Oct 1, 2025 • 11

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 118

Yiming Zhao

AI & ML interests

Recent Activity

Organizations

gaotiexinqu's activity

Image Arena Leaderboard