Plyusov
daniilplyusov
AI & ML interests
None yet
Recent Activity
upvoted a paper 3 days ago
Trust-Region Behavior Blending for On-Policy Distillation upvoted a paper 3 months ago
Next Embedding Prediction Makes World Models Stronger upvoted a paper 4 months ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward ExtrapolationOrganizations
None yet