arxiv:2505.10527
wang binghai
refrain-wbh
AI & ML interests
None yet
Recent Activity
commentedon a paper about 9 hours ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards upvoted a paper about 20 hours ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards upvoted a paper 5 months ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward ModelsOrganizations
None yet