https://github.com/dhcode-cpp/X-R1
xiaodongguaAIGC
xiaodongguaAIGC
AI & ML interests
RLHF
Organizations
None yet
models 10
xiaodongguaAIGC/Qwen3-moe-mini
Text Generation • 2B • Updated • 4
xiaodongguaAIGC/X-R1-3B-CN
Text Generation • 3B • Updated • 4 • 2
xiaodongguaAIGC/X-R1-3B
Text Generation • 3B • Updated • 5 • 2
xiaodongguaAIGC/X-R1-1.5B
Text Generation • 2B • Updated • 1
xiaodongguaAIGC/X-R1-0.5B
Text Generation • 0.5B • Updated • 1 • 1
xiaodongguaAIGC/xdg-math-step
Text Generation • 8B • Updated • 2 • • 1
xiaodongguaAIGC/xdg-math-step-0118
Text Generation • 8B • Updated • 3
xiaodongguaAIGC/xdg-math-prm-lora
Updated • 1
xiaodongguaAIGC/xdg-llama-3-8B
Text Generation • 8B • Updated • 15 • • 5
xiaodongguaAIGC/llama-3-debug
Text Generation • 16.5M • Updated • 48 • 2
datasets 16
xiaodongguaAIGC/X-R1-TAL-SCQ5K
Viewer • Updated • 10k • 18 • 3
xiaodongguaAIGC/X-R1-TAL-SCQ2K
Viewer • Updated • 3.33k • 71 • 1
xiaodongguaAIGC/X-R1-7500
Viewer • Updated • 12.5k • 18 • 2
xiaodongguaAIGC/X-R1-1500
Viewer • Updated • 2.5k • 16
xiaodongguaAIGC/X-R1-750
Viewer • Updated • 1.25k • 109 • 4
xiaodongguaAIGC/step_sft
Viewer • Updated • 84.2k • 121
xiaodongguaAIGC/step_prm
Viewer • Updated • 108k • 49
xiaodongguaAIGC/math_step_sft
Viewer • Updated • 12.5k • 10
xiaodongguaAIGC/GSM8k_step_sft
Viewer • Updated • 8.79k • 16
xiaodongguaAIGC/prm800k_step_sft
Viewer • Updated • 121k • 7