1 104 2

hangyu guo

Rosiness

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 13 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

upvoted a paper 14 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper 20 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

View all activity

Organizations

upvoted a paper 13 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 14 days ago • 185

upvoted a paper 14 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 20 days ago • 159

upvoted a paper 20 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Paper • 2605.10832 • Published 22 days ago • 21

upvoted a paper about 1 month ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 85

updated a dataset about 1 month ago

MM-R1-HH/envs_supply

Preview • Updated Apr 20 • 67

published a dataset about 1 month ago

MM-R1-HH/envs_supply

Preview • Updated Apr 20 • 67

upvoted 4 papers about 2 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 164

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 66

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 121

Towards Long-horizon Agentic Multimodal Search

Paper • 2604.12890 • Published Apr 14 • 20

upvoted 4 papers 2 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 53

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published Mar 23 • 29

WorldCache: Content-Aware Caching for Accelerated Video World Models

Paper • 2603.22286 • Published Mar 23 • 5

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

upvoted 2 papers 3 months ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 140

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published Mar 12 • 34

authored a paper 3 months ago

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Paper • 2603.13391 • Published Mar 11 • 20

upvoted 3 papers 3 months ago

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Paper • 2603.07980 • Published Mar 9 • 27

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Paper • 2603.03447 • Published Mar 3 • 37

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Paper • 2602.23166 • Published Feb 26 • 45

hangyu guo

AI & ML interests

Recent Activity

Organizations

Rosiness's activity