unsloth/DeepSeek-R1-Distill-Llama-70B-GGUF Text Generation • 71B • Updated May 10, 2025 • 37.3k • 100
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models Paper • 2409.09345 • Published Sep 14, 2024 • 1
Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots
TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T Text Generation • 1B • Updated Sep 27, 2024 • 30.2k • • 184