SFT Final Models
Collection
Models that were trained on clembench v0.9 - v1.6 • 4 items • Updated
This model is a fine-tuned version of unsloth/meta-llama-3.1-8b-instruct-bnb-4bit on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.2589 | 0.0565 | 100 | 0.3587 |
| 0.1837 | 0.1130 | 200 | 0.2943 |
| 0.1982 | 0.1695 | 300 | 0.2688 |
| 0.158 | 0.2260 | 400 | 0.2513 |
| 0.1527 | 0.2825 | 500 | 0.2402 |
| 0.147 | 0.3390 | 600 | 0.2392 |
| 0.1022 | 0.3955 | 700 | 0.2372 |