Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FineInstructions Pretraining Corpora

Team
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

AjayP13  authored a paper 29 days ago
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
AjayP13  updated a dataset about 1 month ago
fineinstructions-pretraining/dclm_baseline_1.0_actual_all
AjayP13  updated a dataset about 1 month ago
fineinstructions-pretraining/fineweb_edu_actual_all
View all activity

Ajay Patel's profile picture Colin Raffel's profile picture

models 0

None public yet

datasets 24

fineinstructions-pretraining/dclm_baseline_1.0_actual_all

Viewer • Updated Jan 29 • 2.94B • 59 • 1

fineinstructions-pretraining/nemotron_fineinstructions_1T_exp_chat

Viewer • Updated Jan 29 • 721M • 58 • 2

fineinstructions-pretraining/nemotron_qa_1T_exp

Viewer • Updated Jan 29 • 425M • 118

fineinstructions-pretraining/ipt_fineinstructions_judged

Viewer • Updated Jan 29 • 763k • 18

fineinstructions-pretraining/ipt_fineinstructions_all_judged_exp_chat

Viewer • Updated Jan 29 • 34.5M • 42

fineinstructions-pretraining/ipt_synthetic_all_exp

Viewer • Updated Jan 29 • 26.6M • 15 • 1

fineinstructions-pretraining/fineweb_edu_actual_all

Preview • Updated Jan 29 • 301 • 1

fineinstructions-pretraining/nemotron_synthetic_1T_exp

Viewer • Updated Jan 29 • 448M • 52

fineinstructions-pretraining/nemotron_actual_1T_exp

Viewer • Updated Jan 29 • 302M • 33

fineinstructions-pretraining/nemotron_wrap_1T_exp

Viewer • Updated Jan 29 • 302M • 130
View 24 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs