PENNY
Collection
8 items • Updated • 2
base_model: meta-llama/Meta-Llama-3-8B gate_mode: hidden dtype: bfloat16 experts:
| Metric | Value |
|---|---|
| Avg. | 65.13 |
| AI2 Reasoning Challenge (25-Shot) | 62.80 |
| HellaSwag (10-Shot) | 83.60 |
| MMLU (5-Shot) | 65.13 |
| TruthfulQA (0-shot) | 50.41 |
| Winogrande (5-shot) | 77.27 |
| GSM8k (5-shot) | 58.68 |