MBZUAI
/

MobiLlama-05B

Text Generation

text-generation-inference

Model card Files Files and versions

Update README.md

#1

by Ashmal - opened Feb 26, 2024

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +20 -5

README.md CHANGED Viewed

@@ -2,11 +2,13 @@
 license: mit
 license_link: https://huggingface.co/microsoft/phi-2/resolve/main/LICENSE
 language:
-  - en
 pipeline_tag: text-generation
 tags:
-  - nlp
-  - code
 ---
 # MobiLlama-05B
@@ -57,7 +59,20 @@ print(tokenizer.batch_decode(outputs[:, input_ids.shape[1]:-1])[0].strip())
 ```
-## Intended Uses
-Given the nature of the training data, the MobiLlama-05B model is best suited for prompts using the QA format, the chat format, and the code format.

 license: mit
 license_link: https://huggingface.co/microsoft/phi-2/resolve/main/LICENSE
 language:
+- en
 pipeline_tag: text-generation
 tags:
+- nlp
+- code
+datasets:
+- LLM360/AmberDatasets
 ---
 # MobiLlama-05B
 ```
+## Evaluation
+| Evaluation Benchmark | MobiLlama-0.5B | MobiLlama-0.8B | MobiLlama-1.2B |
+| ----------- | ----------- | ----------- |
+| HellaSwag | 0.5252 | 0.5409 | 0.6299 |
+| MMLU | 0.2645 | 0.2692 | 0.2423 |
+| Arc Challenge | 0.2952 | 0.3020 | 0.3455 |
+| TruthfulQA | 0.3805 | 0.3848 | 0.3557 |
+| CrowsPairs | 0.6403 | 0.6482 | 0.6812 |
+| PIQA | 0.7203 | 0.7317 | 0.7529 |
+| Race | 0.3368 | 0.3337 | 0.3531 |
+| SIQA | 0.4022 | 0.4160 | 0.4196 |
+| Winogrande | 0.5753 | 0.5745 | 0.6108 |
+## Intended Uses
+Given the nature of the training data, the MobiLlama-05B model is best suited for prompts using the QA format, the chat format, and the code format.