sungyub kim

sungyub

AI & ML interests

None yet

Recent Activity

upvoted an article 16 days ago

We Got Claude to Build CUDA Kernels and teach open models!

upvoted an article 16 days ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

updated a collection about 1 month ago

VERL QA Datasets

View all activity

Organizations

None yet

upvoted 2 articles 16 days ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

17 days ago

•

138

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

18 days ago

•

updated a collection about 1 month ago

VERL QA Datasets

Collection

High-quality QA generation datasets in VERL format: document QA, table reasoning, and multi-hop reasoning tasks. • 7 items • Updated Jan 8

updated a dataset about 1 month ago

sungyub/qa-verl-unified

Viewer • Updated Jan 8 • 86.4k • 104

published a dataset about 1 month ago

sungyub/qa-verl-unified

Viewer • Updated Jan 8 • 86.4k • 104

updated 2 datasets about 1 month ago

sungyub/docqa-rl-verl

Viewer • Updated Jan 8 • 3.6k • 46

sungyub/code-verl-unified

Viewer • Updated Jan 8 • 959k • 155 • 1

liked 2 Spaces 2 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.29k

Download a trillion‑token web text dataset for LLM training

Evaluation Guidebook

📝

269

Explore LLM benchmark trends over time

updated a dataset 3 months ago

sungyub/codev-r1-verl

Viewer • Updated Nov 11, 2025 • 3.13k • 19

upvoted an article 3 months ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

206

liked 2 Spaces 3 months ago

The Ultra-Scale Playbook

🌌

3.69k

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

📚

2.98k

The secrets to building world-class LLMs

updated 7 datasets 3 months ago

sungyub kim

AI & ML interests

Recent Activity

Organizations

sungyub's activity

We Got Claude to Build CUDA Kernels and teach open models!

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

FineWeb: decanting the web for the finest text data at scale

Evaluation Guidebook

Let's talk about LLM evaluation

The Ultra-Scale Playbook

The Smol Training Playbook