Running on CPU Upgrade Agents 606 GAIA Leaderboard π¦Ύ 606 Submit and score your model on the GAIA benchmark
Running on CPU Upgrade 198 LLM Hallucination Leaderboard π 198 View and filter LLM hallucination leaderboard
Jofthomas/hermes-function-calling-thinking-V1 Viewer β’ Updated Feb 16, 2025 β’ 3.57k β’ 635 β’ 78
Running 3.86k The Ultra-Scale Playbook π 3.86k The ultimate guide to training LLM on large GPU Clusters
Running Agents 80 AI Energy Score Leaderboard π 80 Explore AI energy efficiency across various tasks
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 951k β’ β’ 2.78k
Running Agents 111 Judge Arena π» 111 View and compare openβsource AI model rankings with ELO scores
Running Featured 598 Image Arena Leaderboard π 598 Image Generation and Image Editing Arena & Leaderboard
meta-llama/Llama-3.1-8B-Instruct Text Generation β’ 8B β’ Updated Sep 25, 2024 β’ 10.6M β’ β’ 5.89k
meta-llama/Llama-3.1-405B-Instruct Text Generation β’ 406B β’ Updated Sep 25, 2024 β’ 228k β’ 594
meta-llama/Llama-3.1-70B-Instruct Text Generation β’ 71B β’ Updated Dec 15, 2024 β’ 757k β’ β’ 916