·
AI & ML interests
LLM post-training
Organizations
ydeng9/OpenVLThinker-grpo-hard
Viewer
• Updated • 6.25k • 28
• 1
ydeng9/OpenVLThinker-grpo-medium
Viewer
• Updated • 3.3k • 15
Viewer
• Updated • 960 • 5
Viewer
• Updated • 2.3k • 10
Viewer
• Updated • 82.8k • 13
Viewer
• Updated • 1.76k • 8
Viewer
• Updated • 1.32k • 10
Viewer
• Updated • 789 • 8
Viewer
• Updated • 6 • 12
ydeng9/swe-smith-rl-distill
Viewer
• Updated • 7.81k • 15
ydeng9/OpenVLThinker-sft-iter3
Viewer
• Updated • 3.28k • 23
ydeng9/OpenVLThinker_sft_iter2
Viewer
• Updated • 5.54k • 6
ydeng9/captioned-data-subsetv1
Viewer
• Updated • 59.3k • 19
Viewer
• Updated • 3.11k • 98
• 1
Viewer
• Updated • 5.87k • 288
• 1