17 113

z

Huye2023

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago

Nemotron-Pre-Training-Datasets

liked a dataset about 2 months ago

nvidia/Nemotron-Pretraining-Dataset-sample

liked a model 4 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

View all activity

Organizations

None yet

upvoted a collection about 2 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 6 days ago • 164

liked a dataset about 2 months ago

nvidia/Nemotron-Pretraining-Dataset-sample

Viewer • Updated Dec 22, 2025 • 27.7k • 1.24k • 64

liked a model 4 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated Sep 17, 2025 • 231k • • 1.03k

upvoted a collection 4 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.68k

liked a model 5 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26, 2025 • 1.74M • 900

liked a dataset about 1 year ago

gaia-benchmark/GAIA

Viewer • Updated Oct 28, 2025 • 932 • 42.2k • 695

upvoted a paper about 1 year ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5, 2025 • 85

liked a dataset about 1 year ago

nvidia/OpenMathReasoning

Viewer • Updated May 27, 2025 • 5.68M • 17.7k • 465

upvoted a paper about 1 year ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10, 2025 • 153

liked a dataset over 1 year ago

PrimeIntellect/SYNTHETIC-1

Viewer • Updated Feb 21, 2025 • 1.99M • 1.89k • 62

upvoted an article over 1 year ago

Article

The Technology Behind BLOOM Training

stas

•

Jul 14, 2022

• 45

upvoted 2 papers over 1 year ago

The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models

Paper • 2501.09653 • Published Jan 16, 2025 • 12

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 23, 2025 • 41

liked 7 datasets over 1 year ago

z

AI & ML interests

Recent Activity

Organizations

Huye2023's activity

The Technology Behind BLOOM Training