Plyusov's picture

7

Plyusov

daniilplyusov

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Trust-Region Behavior Blending for On-Policy Distillation

upvoted a paper 3 months ago

Next Embedding Prediction Makes World Models Stronger

upvoted a paper 4 months ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

View all activity

Organizations

None yet

daniilplyusov 's models 1

daniilplyusov/reward_model

Updated Feb 2, 2025