quancute/qwen3_06b_grpo_multievalvietsum_penalty_in_domain Feature Extraction • 0.6B • Updated 29 days ago • 18
quancute/DPOLlama-3.2-1B-Instruct_sum-chosen5_reject_less2-5k_22Mar-2025_A100 1B • Updated Mar 23, 2025
quancute/DPOLlama-3.2-1B-Instruct_sum-chosen5_reject_greater3-20k_22Mar-2025_A100 1B • Updated Mar 23, 2025 • 1
quancute/DPOLlama-3.2-1B-Instruct_sum-39k_12Mar-2025_A100_new Text Generation • 1B • Updated Mar 13, 2025 • 1 •
quancute/DPOLlama-3.2-1B-Instruct_sum-39k_8Mar-2025_A100 Text Generation • 1B • Updated Mar 11, 2025 •
quancute/Llama-3.2-1B-Instruct_sum-10k_2Mar-2025_A100 Text Generation • 1B • Updated Mar 3, 2025 • 2 •