-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 5.35M • • 13.4k -
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 742 • 764 -
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 630
Sunny Ratnani
SunnyRatnaniMD
·
AI & ML interests
None yet
Organizations
Medical License Exam
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 5.35M • • 13.4k -
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 742 • 764 - Running3.88k
The Ultra-Scale Playbook
🌌3.88kThe ultimate guide to training LLM on large GPU Clusters
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 630