Math reasoning models distilled with Importance Weighted OPD.
Yan Xie
YannX
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
On the Position Bias of On-Policy Distillation updated a collection 25 days ago
IW-OPD-math updated a collection 25 days ago
IW-OPD-math