mradermacher/GCIRS-Reasoning-1.5B-R1-GGUF Reinforcement Learning • 2B • Updated Jul 11, 2025 • 62 • 1