Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hu Yunhai's picture
1 2 3

Hu Yunhai

AlexCCtop

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
upvoted a paper about 1 month ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
upvoted a paper about 1 month ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 90

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 148
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs