NamrataThakur/Small_Language_Model_GQA_48M_Pretrained Text Generation • Updated 28 days ago • 2.7k • 1
NamrataThakur/Small_Language_Model_MOE_127M_Pretrained Text Generation • Updated 28 days ago • 2.55k • 1