Fine Tuning Datasets ethanolivertroy/nist-cybersecurity-training Viewer • Updated Oct 22, 2025 • 531k • 1.26k • 52 darkknight25/Vulnerable_Programming_Dataset Updated May 24, 2025 • 65 • 1 WNT3D/Ultimate-Offensive-Red-Team Viewer • Updated Aug 23, 2025 • 25.6k • 547 • 146 yevzh1/Ultimate-Offensive-Red-Team Viewer • Updated Jan 4 • 25.6k • 74
Bench Datasets Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 116k • 458 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 913k • 1.38k princeton-nlp/SWE-bench_Verified Viewer • Updated Feb 18, 2025 • 500 • 924k • 353 ScaleAI/SWE-bench_Pro Benchmark • Updated Feb 23 • 731 • 70.1k • 124
Gym ServiceNow-AI/EnterpriseOps-Gym Viewer • Updated Apr 30 • 2.56k • 7.52k • 89 allenai/MolmoWeb-HumanSkills Viewer • Updated Apr 13 • 116k • 1.5k • 14 allenai/MolmoWeb-SyntheticSkills Viewer • Updated Apr 13 • 5.55k • 282 • 7 allenai/MolmoWeb-SyntheticTrajs Viewer • Updated Apr 10 • 108k • 1.39k • 10
Fine Tuning Datasets ethanolivertroy/nist-cybersecurity-training Viewer • Updated Oct 22, 2025 • 531k • 1.26k • 52 darkknight25/Vulnerable_Programming_Dataset Updated May 24, 2025 • 65 • 1 WNT3D/Ultimate-Offensive-Red-Team Viewer • Updated Aug 23, 2025 • 25.6k • 547 • 146 yevzh1/Ultimate-Offensive-Red-Team Viewer • Updated Jan 4 • 25.6k • 74
Gym ServiceNow-AI/EnterpriseOps-Gym Viewer • Updated Apr 30 • 2.56k • 7.52k • 89 allenai/MolmoWeb-HumanSkills Viewer • Updated Apr 13 • 116k • 1.5k • 14 allenai/MolmoWeb-SyntheticSkills Viewer • Updated Apr 13 • 5.55k • 282 • 7 allenai/MolmoWeb-SyntheticTrajs Viewer • Updated Apr 10 • 108k • 1.39k • 10
Bench Datasets Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 116k • 458 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 913k • 1.38k princeton-nlp/SWE-bench_Verified Viewer • Updated Feb 18, 2025 • 500 • 924k • 353 ScaleAI/SWE-bench_Pro Benchmark • Updated Feb 23 • 731 • 70.1k • 124