xiao45791/Qwen3-VL-8B-Instruct-SFT-Gemini-Distill-100k Image-Text-to-Text • 9B • Updated 7 days ago • 39
xiao45791/Qwen3-VL-8B-Instruct-SFT-Gemini-Distill-100k Image-Text-to-Text • 9B • Updated 7 days ago • 39
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-dapo-1144steps 5B • Updated 21 days ago • 54
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-dapo-1144steps 5B • Updated 21 days ago • 54
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1320steps 5B • Updated 21 days ago • 51
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1320steps 5B • Updated 21 days ago • 51
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1500steps 5B • Updated 21 days ago • 63
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1500steps 5B • Updated 21 days ago • 63
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-720steps 5B • Updated 23 days ago • 57
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-720steps 5B • Updated 23 days ago • 57
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-960steps 5B • Updated 24 days ago • 66
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-960steps 5B • Updated 24 days ago • 66
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published Mar 16 • 185