A collection of Small Language Models pretrained from scratch (using only PyTorch) on Tiny Stories Dataset on a single Tesla-T4 16GB GPU.
Namrata Thakur
NamrataThakur
AI & ML interests
Small Language Model, Fine-Tuning, From Scratch
Organizations
None yet
models 11
NamrataThakur/Small_Language_Model_MOE_127M_Pretrained
Text Generation • Updated • 2.27k • 3
NamrataThakur/Small_Language_Model_GQA_48M_Pretrained
Text Generation • Updated • 1.35k • 1
NamrataThakur/Small_Language_Model_MHA_53M_Pretrained
Text Generation • Updated • 1.35k • 1
NamrataThakur/llama31-8bn_Reinforcement-Fine-Tuned
Question Answering • 8B • Updated • 32
NamrataThakur/llama31-8bn_SFT
Question Answering • 8B • Updated • 17
NamrataThakur/llama32-1bn_finetuned
Question Answering • 1B • Updated • 10
NamrataThakur/llama32-1bn_RAFT
Question Answering • 1B • Updated • 12
NamrataThakur/GPT2_355M_Perference-Fine-Tune_DPO
Question Answering • Updated
NamrataThakur/llama32-1bn_FederatedLearning_Fine-Tuned_nonQuantized
Question Answering • 1B • Updated • 12
NamrataThakur/llama32-1bn_FederatedLearning_Fine-Tuned
Question Answering • 1B • Updated • 14