arxiv:2505.10527
wang binghai
refrain-wbh
AI & ML interests
None yet
Recent Activity
commentedon a paper 1 day ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards upvoted a paper 1 day ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards upvoted a paper 5 months ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward ModelsOrganizations
None yet