quancute/qwen3_06b_grpo_multievalvietsum_penalty_in_domain Feature Extraction • 0.6B • Updated 29 days ago • 18
quancute/grpo_qwen3_0_6b_nopenalty_in_domain Feature Extraction • 0.6B • Updated about 1 month ago • 6