Long Reasoning Datasets with reasoning traces for math and code (Train + Eval) HuggingFaceH4/MATH Viewer • Updated Jan 28, 2025 • 13.8k • 672 • 8 HuggingFaceH4/MATH-500 Viewer • Updated Dec 15, 2025 • 500 • 116k • 288 microsoft/orca-math-word-problems-200k Viewer • Updated Mar 4, 2024 • 200k • 5.61k • 476 openai/gsm8k Benchmark • Updated Dec 20, 2025 • 17.6k • 644k • 1.2k
Long Reasoning Datasets with reasoning traces for math and code (Train + Eval) HuggingFaceH4/MATH Viewer • Updated Jan 28, 2025 • 13.8k • 672 • 8 HuggingFaceH4/MATH-500 Viewer • Updated Dec 15, 2025 • 500 • 116k • 288 microsoft/orca-math-word-problems-200k Viewer • Updated Mar 4, 2024 • 200k • 5.61k • 476 openai/gsm8k Benchmark • Updated Dec 20, 2025 • 17.6k • 644k • 1.2k