haoyu wang

haoyuw

·

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

liked a dataset about 2 months ago

OpenRubrics/RubricARROW-Judge-SFT

upvoted a paper about 2 months ago

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

View all activity

Organizations

Papers 3

arxiv:2605.29156

arxiv:2602.01511

arxiv:2510.07743

models 3

haoyuw/Qwen2.5-1.5B-Math-Instruct-LIMO-Rewrite

Text Generation • 2B • Updated Mar 8, 2025 • 6

haoyuw/Qwen2.5-1.5B-Math-Instruct-LIMO

Text Generation • 2B • Updated Mar 8, 2025 • 8

haoyuw/Qwen2.5-1.5B-Instruct-LIMO

Text Generation • 2B • Updated Mar 8, 2025 • 4

datasets 5

haoyuw/cn_math_2024

Viewer • Updated Jun 30, 2025 • 30 • 16

haoyuw/aime

Viewer • Updated May 22, 2025 • 30 • 29

haoyuw/minerva

Viewer • Updated May 7, 2025 • 272 • 19

haoyuw/olympiad_bench

Viewer • Updated May 7, 2025 • 675 • 12

haoyuw/minervamath_latex

Viewer • Updated Mar 24, 2025 • 272 • 12