AI & ML interests
None defined yet.
selfcorrexp/llama3_it_8b_tmp07_n3
Viewer
• Updated • 15k • 3
selfcorrexp/llama3_it_8b_tmp10_n3
Viewer
• Updated • 15k • 3
selfcorrexp/llama3_regular_balanced_sft_4_ORM_training
Viewer
• Updated • 174k • 3
selfcorrexp/llama3_regular_NON_balanced_sft_4_ORM_training
Viewer
• Updated • 327k • 3
selfcorrexp/llama3_non_delete_4_ORM_training
Viewer
• Updated • 191k • 3
selfcorrexp/llama3_regular_balanced_sft_chat_format
Viewer
• Updated • 174k • 4
selfcorrexp/llama3_additional_rr40k_non_delete_sft_chat_format
Viewer
• Updated • 231k • 4
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_2
Viewer
• Updated • 25.5k • 1
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_1
Viewer
• Updated • 25.5k • 3
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_2
Viewer
• Updated • 21.4k • 2
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_1
Viewer
• Updated • 21.4k • 3
selfcorrexp/llama31_prompt_first_corr_math1
Viewer
• Updated • 60k • 3
selfcorrexp/llama31_prompt_first_wrong_math2
Viewer
• Updated • 118k • 4
selfcorrexp/llama31_prompt_first_wrong_math1
Viewer
• Updated • 110k • 4
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_2nd_round_prompt
Viewer
• Updated • 21.4k • 4
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_2nd_round_prompt
Viewer
• Updated • 25.5k • 3
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_math_base
Viewer
• Updated • 7.01k • 3
selfcorrexp/baseline_star_rr80k
Viewer
• Updated • 257k • 3
selfcorrexp/baseline_star_rr8ou0k
selfcorrexp/baseline_star
Viewer
• Updated • 176k • 3
selfcorrexp/llama3_non_delete_rr40k_3ep_dpo_gen_augmath_base
Viewer
• Updated • 15.1k • 3
selfcorrexp/llama3_additional_rr40k_non_delete_sft
Viewer
• Updated • 231k • 3
selfcorrexp/llama3_non_delete_regular_balanced_sft
Viewer
• Updated • 191k • 3
selfcorrexp/llama3_additional_rr80k_NON_balanced_sft
Viewer
• Updated • 407k • 2
selfcorrexp/llama31_prompt_first_wrong_prompt2
Viewer
• Updated • 60.4k • 3
selfcorrexp/llama31_prompt_first_wrong_prompt1
Viewer
• Updated • 60k • 3
selfcorrexp/llama31_prompt_corr_prompt
Viewer
• Updated • 60k • 3
selfcorrexp/llama3_non_balanced_rr10k_2e6_bz32_ep3tmp07
Viewer
• Updated • 15k • 3
selfcorrexp/llama3_non_balanced_rr10k_2e6_bz32_ep3tmp10
Viewer
• Updated • 15k • 3
selfcorrexp/llama3_v2_rlhflow_math2
Viewer
• Updated • 7.5k • 3