LLama3-2-1B-distortion-fold-1-1a-v1
This model is a fine-tuned version of meta-llama/Llama-3.2-1B on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 6.2378
- Accuracy: 0.3656
- Precision Macro: 0.4115
- Recall Macro: 0.3206
- F1 Macro: 0.3231
- Precision Weighted: 0.4038
- Recall Weighted: 0.3656
- F1 Weighted: 0.3464
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- lr_scheduler_warmup_ratio: 0.06
- num_epochs: 30
- mixed_precision_training: Native AMP
Training results
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision Macro | Recall Macro | F1 Macro | Precision Weighted | Recall Weighted | F1 Weighted |
|---|---|---|---|---|---|---|---|---|---|---|
| 2.6325 | 1.0 | 60 | 2.1900 | 0.2656 | 0.3628 | 0.2534 | 0.2179 | 0.3835 | 0.2656 | 0.2381 |
| 1.5559 | 2.0 | 120 | 2.0151 | 0.3469 | 0.3820 | 0.3303 | 0.3174 | 0.4140 | 0.3469 | 0.3385 |
| 0.8601 | 3.0 | 180 | 3.2896 | 0.3312 | 0.3563 | 0.3159 | 0.3102 | 0.3758 | 0.3312 | 0.3305 |
| 0.2825 | 4.0 | 240 | 4.5494 | 0.375 | 0.4280 | 0.3565 | 0.3506 | 0.4264 | 0.375 | 0.3681 |
| 0.2122 | 5.0 | 300 | 4.6586 | 0.3375 | 0.3757 | 0.3191 | 0.3110 | 0.3753 | 0.3375 | 0.3159 |
| 0.1402 | 6.0 | 360 | 5.4650 | 0.3625 | 0.4218 | 0.3343 | 0.3321 | 0.4210 | 0.3625 | 0.3509 |
| 0.0657 | 7.0 | 420 | 6.2378 | 0.3656 | 0.4115 | 0.3206 | 0.3231 | 0.4038 | 0.3656 | 0.3464 |
Framework versions
- Transformers 4.57.1
- Pytorch 2.9.1+cu128
- Datasets 4.4.1
- Tokenizers 0.22.1
- Downloads last month
- 1
Model tree for Kudod/LLama3-2-1B-distortion-fold-1-1a-v1
Base model
meta-llama/Llama-3.2-1B