In a Training Loop 🔄

4 47 15

Yunzhuo Hao

luckychao

hychaochao

AI & ML interests

NLP

Recent Activity

upvoted a paper 10 days ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

upvoted a paper 19 days ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

upvoted a paper about 1 month ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

View all activity

Organizations

upvoted a paper 10 days ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 11 days ago • 255

upvoted a paper 19 days ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published 21 days ago • 85

upvoted 3 papers about 1 month ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published Mar 17 • 58

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published Mar 16 • 149

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 424

upvoted an article about 1 month ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

Mar 5

•

125

upvoted 2 papers about 2 months ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Paper • 2603.03241 • Published Mar 3 • 87

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 103

upvoted 3 papers 2 months ago

upvoted a paper 3 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

upvoted a paper 4 months ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published Dec 22, 2025 • 66

upvoted 4 papers 5 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 44

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9, 2025 • 25

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 242

upvoted 2 papers 6 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 87

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10, 2025 • 37

upvoted a paper 7 months ago

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18, 2025 • 53

Yunzhuo Hao

AI & ML interests

Recent Activity

Organizations

luckychao's activity

NEO-unify: Building Native Multimodal Unified Models End to End