AI & ML interests
Large Language Models
Organizations
None yet
tanaymehta/gpt2_900K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_800K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_700K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_600K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_500K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_400K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_300K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_200K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_100K_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_1M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_2M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_3M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_4M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_5M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_6M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_7M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_8M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_1_9M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_2M_100eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_2H_12L_1M_tok_50_eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_4H_12L_1M_tok_50_eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_6H_12L_1M_tok_50_eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_8H_12L_1M_tok_50_eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_12H_12L_5M_tok_100_eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_12H_12L_10M_tok_100_eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_12H_12L_15M_tok_100_eps
Text Generation
• 0.1B • Updated
• 1
tanaymehta/gpt2_12H_2L_75M_tok_20_eps
Text Generation
• 53.6M • Updated
tanaymehta/gpt2_12H_4L_75M_tok_20_eps
Text Generation
• 67.7M • Updated
tanaymehta/gpt2_12H_6L_75M_tok_20_eps
Text Generation
• 81.9M • Updated
• 2