·
AI & ML interests
None yet
Organizations
None yet
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch3-cosine0516-v1
Text Generation
• 8B • Updated
• 6
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2
Text Generation
• 8B • Updated
• 5
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v1
Text Generation
• 8B • Updated
• 7
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0514-v2
Text Generation
• 8B • Updated
• 14
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0514-v1
Text Generation
• 8B • Updated
• 10
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0513-v1
Text Generation
• 8B • Updated
• 1
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2
Text Generation
• 8B • Updated
• 5
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1
Text Generation
• 8B • Updated
• 7
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0511-v3
Text Generation
• 8B • Updated
• 12
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch3-cosine0511-v2
Text Generation
• 8B • Updated
• 2
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0510-v1
Text Generation
• 8B • Updated
• 6
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch3-cosine0510-v1
Text Generation
• 8B • Updated
• 1
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0509
Text Generation
• 8B • Updated
• 3
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-0509
Text Generation
• 8B • Updated
• 8
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0507-wRv2
Text Generation
• 8B • Updated
• 1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0507-wR
Text Generation
• 8B • Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0507
Text Generation
• 8B • Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0506
Text Generation
• 8B • Updated
• 1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0505
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0502
Text Generation
• 8B • Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0430
Text Generation
• 8B • Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0429
Text Generation
• 8B • Updated
• 5
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0428-updatePW
Text Generation
• 8B • Updated
• 1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0428
Text Generation
• 8B • Updated
• 5
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0427-updatePW
Text Generation
• 3B • Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0427-updatePW
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0426
Text Generation
• 8B • Updated
• 1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0426-updatePW
Text Generation
• 8B • Updated
• 1