ยท
AI & ML interests
Large Language Models, Distributed Training and Inference
Organizations
-
-
-
-
-
-
-
-
-
-
-
published an
article over 1 year ago view article Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2
- +4
published an
article over 1 year ago view article Saving Memory Using Padding-Free Transformer Layers during Finetuning
published an
article almost 2 years ago view article Aurora-M: The First Open Source Biden-Harris Executive Order Red teamed Multilingual Language Model