SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.1M • 926 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 116k • 727 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 29.2k • 583 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 399k • 185
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 313k • 1.01k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 7.74k • • 210 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 590 • 21 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 11.9k • 185
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 7.74k • • 210
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 4.63k • 666 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 14.3k • 973 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 101k • 216 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 10.8k • 2.94k
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 260 • 74 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 38 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 249 • 104 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 21.9k • 806
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 12.6k • 681 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 551 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.55M • 3.44k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 206 • 134
SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.1M • 926 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 116k • 727 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 29.2k • 583 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 399k • 185
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 260 • 74 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 38 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 249 • 104 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 21.9k • 806
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 313k • 1.01k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 7.74k • • 210 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 590 • 21 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 11.9k • 185
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 7.74k • • 210
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 12.6k • 681 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 551 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.55M • 3.44k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 206 • 134
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 4.63k • 666 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 14.3k • 973 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 101k • 216 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 10.8k • 2.94k