BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published Aug 14, 2025 • 60
qfq/genminiall_hardfiltered_onlyqwenwrong_aimegpqatrain_powerlaw_nostepsnoanswer Viewer • Updated Jan 14, 2025 • 1k • 5 • 1