Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
5
Plyusov
daniilplyusov
Follow
kefirski's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
25 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
upvoted
a
paper
29 days ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare
upvoted
a
paper
7 months ago
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success
View all activity
Organizations
None yet
daniilplyusov
's models
1
Sort: Recently updated
daniilplyusov/reward_model
Updated
Feb 2, 2025