Hao Li

Richardleee

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

upvoted a paper 9 days ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

upvoted a paper about 1 month ago

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

View all activity

Organizations

upvoted 2 papers 9 days ago

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Paper • 2606.04923 • Published 11 days ago • 37

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

Paper • 2606.05259 • Published 11 days ago • 35

upvoted 3 papers about 1 month ago

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

Paper • 2605.04036 • Published May 5 • 69

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published May 4 • 131

Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

Paper • 2605.04018 • Published May 5 • 40

updated a dataset 4 months ago

SWVR2/video_0042

Updated Feb 28 • 1.22k

published a dataset 4 months ago

SWVR2/video_0042

Updated Feb 28 • 1.22k

updated a dataset 4 months ago

SWVR2/video_0043

Updated Feb 28 • 323

published a dataset 4 months ago

SWVR2/video_0043

Updated Feb 28 • 323

updated 2 datasets 4 months ago

SWVR2/video_0035

Updated Feb 26 • 315

SWVR2/video_0044

Updated Feb 26 • 210

published a dataset 4 months ago

SWVR2/video_0044

Updated Feb 26 • 210

updated a dataset 4 months ago

SWVR2/video_0045

Updated Feb 25 • 308

upvoted a paper 4 months ago

SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

Paper • 2602.05975 • Published Feb 5 • 12

upvoted a paper 10 months ago

CellForge: Agentic Design of Virtual Cell Models

Paper • 2508.02276 • Published Aug 4, 2025 • 40

upvoted a paper 11 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8, 2025 • 78

upvoted 2 papers 12 months ago

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks

Paper • 2507.01001 • Published Jul 1, 2025 • 46

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure

Paper • 2506.12278 • Published Jun 13, 2025 • 16

upvoted 2 papers about 1 year ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28, 2025 • 43

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29, 2025 • 45

Hao Li

AI & ML interests

Recent Activity

Organizations

Richardleee's activity