datasets weaviate/agents Viewer • Updated Jun 11, 2025 • 22.7k • 296 • 13 supermemory/xAFS Updated 30 days ago • 6.75k • 8 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 42.5k • 690 GloriaaaM/LLM-Agent-Harness-Survey Viewer • Updated May 14 • 1 • 1.37k • 7
LLM Evals cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 507k • 767 ZhuofengLi/web-bench Viewer • Updated Jan 19 • 3.94k • 270 OpenResearcher/web-bench Viewer • Updated 26 days ago • 5.5k • 3.12k • 4 blazeofchi/pdf-ocr-rl-dataset Viewer • Updated Mar 1 • 4.24k • 61 • 1
datasets weaviate/agents Viewer • Updated Jun 11, 2025 • 22.7k • 296 • 13 supermemory/xAFS Updated 30 days ago • 6.75k • 8 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 42.5k • 690 GloriaaaM/LLM-Agent-Harness-Survey Viewer • Updated May 14 • 1 • 1.37k • 7
LLM Evals cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 507k • 767 ZhuofengLi/web-bench Viewer • Updated Jan 19 • 3.94k • 270 OpenResearcher/web-bench Viewer • Updated 26 days ago • 5.5k • 3.12k • 4 blazeofchi/pdf-ocr-rl-dataset Viewer • Updated Mar 1 • 4.24k • 61 • 1