GRPO RL model
SunJack
SunJack
·
AI & ML interests
None yet
Organizations
models 14
SunJack/Qwen2.5-3B-R1-GGUF
3B • Updated
• 14
SunJack/Qwen2.5-3B-R1
Updated
SunJack/Phi-4-R1
Updated
SunJack/Phi-4-R1-GGUF
Updated
SunJack/Qwen2.5-7b-sft
Updated
SunJack/phi4-o1
15B • Updated
• 10
SunJack/Qwen2.5-3B-GRPO_lora
Updated
SunJack/qwen2.5-7b-o1
8B • Updated
• 5 • 1
SunJack/qwen2.5-7b-cve
8B • Updated
• 21 • 1
SunJack/qwen2-7b-ruozhiba-finetuning
8B • Updated
• 15 • 2