Evangelinejy/Qwen3_0.6B_LanTokenizer_ctx2048_SFT_trajectory_sep_cot_400 0.4B • Updated 3 days ago • 542
Evangelinejy/Qwen25-1_5b-midtrain-openthoughts-nothink-8192-epoch3.0-bs4 2B • Updated 4 days ago • 14
Evangelinejy/llama-32-3b-midtrain-openthoughts-nothink-8192-epoch3.0-bs4 175k • Updated 7 days ago • 53
Evangelinejy/llama-32-3b-instruct-openthoughts-nothink-8192-epoch1.0-bs4 175k • Updated 13 days ago • 58
Evangelinejy/llama-32-3b-midtrain-openthoughts-nothink-8192-epoch1.0-bs4 175k • Updated 13 days ago • 11
Evangelinejy/llama-32-3b-midtrain-openthoughts-think-8192-epoch1.0-bs4 175k • Updated 14 days ago • 109
Evangelinejy/llama-32-3b-instruct-openthoughts-think-8192-epoch1.0-bs4 175k • Updated 14 days ago • 166
Evangelinejy/llama-32-3b-instruct-openthoughts-nothink-bs4-epoch2.0-ctx8192-ga2-lr1e-05-wr0.1-n4 175k • Updated 25 days ago • 10
Evangelinejy/llama-32-3b-instruct-openthoughts-nothink-bs4-epoch1.0-ctx8192-ga2-lr1e-05-wr0.1-n4 175k • Updated 25 days ago • 18
Evangelinejy/llama3b_base_openthoughts_solution_only-bs4-epoch1.0-ctx8192-ga1-lr5e-05-wr0.1-n4 Text Generation • 175k • Updated 30 days ago • 19
Evangelinejy/llama3b_midtrain_openthoughts_solution_only-bs4-epoch1.0-ctx8192-ga1-lr5e-05-wr0.1-n4 Text Generation • 175k • Updated 30 days ago • 58
Evangelinejy/qwen25-7b-prm_demo-bs2-epoch3.0-ctx4096-ga2-lr1e-05-wr0.1-n4 Text Generation • 333k • Updated Dec 20, 2025 • 1
Evangelinejy/llama-32-3b-instruct-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga2-lr1e-05-wr0.1-n4 175k • Updated Nov 22, 2025 • 98
Evangelinejy/llama3b-instruct-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga2-lr1e-05-wr0.1-n4 175k • Updated Nov 22, 2025
Evangelinejy/llama3b-base-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4 175k • Updated Nov 15, 2025 • 85
Evangelinejy/llama3b-midtrain-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4 175k • Updated Nov 15, 2025 • 378
Evangelinejy/octothinker-3b-short-base-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4 175k • Updated Nov 15, 2025
Evangelinejy/octothinker-3b-hybrid-base-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4 175k • Updated Nov 15, 2025 • 47
Evangelinejy/llama3b-midtrain-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga1-lr5e-06-wr0.1-n4 175k • Updated Nov 12, 2025 • 7
Evangelinejy/octothinker-short-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga1-lr5e-06-wr0.1-n4 175k • Updated Nov 12, 2025
Evangelinejy/octothinker-3b-short-base-data_sft_50k_leon_nemotron-bs4-epoch1.0-ctx4096-ga1-lr1e-05-wr0.1-n4 175k • Updated Nov 12, 2025
Evangelinejy/octothinker-3b-hybrid-base-data_sft_50k_leon_nemotron-bs4-epoch1.0-ctx4096-ga1-lr1e-05-wr0.1-n4 175k • Updated Nov 12, 2025
Evangelinejy/llama3b-midtrain-data_sft_50k_leon_nemotron-bs4-epoch1.0-ctx4096-ga1-lr1e-05-wr0.1-n4 175k • Updated Nov 10, 2025 • 1