A collection of efficient language models for edge deployment. Features MoE architecture with only 25% parameter activation.
Faria Sultana
fariasultana
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 10 hours ago
fariasultana/TrickGPT
updated
a dataset
about 10 hours ago
fariasultana/TrickGPT
published
a dataset
about 12 hours ago
fariasultana/TrickGPT
Organizations
None yet