Updated
• 5.06k
• 196
Viewer
• Updated
• 170M • 192k
• 90
Viewer
• Updated
• 621M • 10.8k
• 87
Locutusque/UltraTextbooks
Viewer
• Updated
• 5.52M • 1.72k
• 198
PrimeIntellect/StackV1-popular
Viewer
• Updated
• 93M • 930
• 2
Viewer
• Updated
• 11.7M • 131
• 5
EleutherAI/the_pile_deduplicated
Viewer
• Updated
• 134M • 11k
• 108
HIT-TMG/KaLM-embedding-pretrain-data
Viewer
• Updated
• 23.7M • 1.35k
• 20
suriyagunasekar/stackoverflow-with-meta-data
Viewer
• Updated
• 19.9M • 169
• 12
Viewer
• Updated
• 13.6M • 1.81k
• 5
Viewer
• Updated
• 3.71M • 927k
• 640
Viewer
• Updated
• 474M • 48
• 4
EleutherAI/deep-ignorance-annealing-mix
Viewer
• Updated
• 89M • 68
• 1
Viewer
• Updated
• 10.2M • 36
• 5
Viewer
• Updated
• 1.76M • 8.01k
• 401
Viewer
• Updated
• 167M • 3.63k
• 66
Locutusque/deeplm-training-data
Viewer
• Updated
• 2.17M • 77
• 3
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
• Updated
• 3.91M • 2.9k
• 644
Updated
• 17.5k
• 248
EssentialAI/essential-web-v1.0
Preview
• Updated
• 90.3k
• 218