Updated • 10.1k
• 196
Viewer
• Updated • 170M • 12.1k
• 97
Viewer
• Updated • 621M • 60.8k
• 88
Locutusque/UltraTextbooks
Viewer
• Updated • 5.52M • 2.36k
• 200
PrimeIntellect/StackV1-popular
Viewer
• Updated • 93M • 169
• 2
Viewer
• Updated • 11.7M • 441
• 8
EleutherAI/the_pile_deduplicated
Viewer
• Updated • 134M • 24.3k
• 114
HIT-TMG/KaLM-embedding-pretrain-data
Viewer
• Updated • 23.7M • 1.04k
• 22
suriyagunasekar/stackoverflow-with-meta-data
Viewer
• Updated • 19.9M • 568
• 12
Viewer
• Updated • 13.6M • 84
• 5
Viewer
• Updated • 3.71M • 1.31M
• 724
Viewer
• Updated • 474M • 182
• 4
EleutherAI/deep-ignorance-annealing-mix
Viewer
• Updated • 89M • 693
• 2
Viewer
• Updated • 10.2M • 319
• 5
Viewer
• Updated • 1.76M • 10.7k
• 407
Viewer
• Updated • 167M • 3.78k
• 74
Locutusque/deeplm-training-data
Viewer
• Updated • 2.17M • 102
• 3
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
• Updated • 3.91M • 4.4k
• 677
Updated • 36.4k
• 262
EssentialAI/essential-web-v1.0
Preview
• Updated • 186k
• 226
Preview
• Updated • 276M • 123k
• 78