pansophic
/

rocket-3B

Text Generation

Model card Files Files and versions

pansophic commited on Nov 23, 2023

Commit

6b65196

·

1 Parent(s): a5ba4ce

Update benchmarks in README

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -67,13 +67,14 @@ In AlpacaEval, Rocket 🦝 achieves a near 80% win rate, coupled with an average
 | Metric                | Value                     |
 |-----------------------|---------------------------|
 | ARC (25-shot)         | 50.51          |
-| HellaSwag (0-shot)   | 73.91    |
-| TruthfulQA (mc2) (0-shot)   | 54.38   |
-| BoolQ (0-shot)        | 81.71        |
 | Winogrande (5-shot)   | 67.8   |
 | GSM8K (5-shot)        | 37.91        |
-| MathQA (5-shot)        | 31.26        |
 ## Intended uses & limitations

 | Metric                | Value                     |
 |-----------------------|---------------------------|
+| Average               | 51.00             |
 | ARC (25-shot)         | 50.51          |
+| HellaSwag (10-shot)   | 76.45    |
+| MMLU (5-shot)        | 45.51        |
+| TruthfulQA (0-shot)   | 54.38   |
 | Winogrande (5-shot)   | 67.8   |
 | GSM8K (5-shot)        | 37.91        |
+| DROP (3-shot)        | 24.49        |
 ## Intended uses & limitations