Diversity-Incentivized Exploration for Versatile Reasoning
Zican Hu
huzican
AI & ML interests
None yet
Recent Activity
upvoted a paper 19 days ago
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe upvoted a paper about 1 month ago
Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning updated a model 2 months ago
huzican/unify_sft_tit-ckpt-3000