This collection contains my GPT-3 Small implementations. All models here share same architecture and are same model on different training stages.
Kyryll Kochkin
k050506koch
AI & ML interests
LLMs. I am obsessed with them. Worked with dense models so far, and plan to experiment with MoEs.
Recent Activity
updated a model about 1 month ago
k050506koch/GPT3-dev-125m-1005 published a model about 1 month ago
k050506koch/GPT3-dev-125m-1005 updated a Space 2 months ago
k050506koch/gpt3-dev-apiOrganizations
None yet