Rzoro
/

checkpoints_3_12

@@ -1,6 +1,4 @@
 ---
-license: mit
-base_model: microsoft/deberta-v3-large
 tags:
 - generated_from_trainer
 model-index:
@@ -13,10 +11,10 @@ should probably proofread and complete it, then remove this comment. -->
 # checkpoints_3_12
-This model is a fine-tuned version of [microsoft/deberta-v3-large](https://huggingface.co/microsoft/deberta-v3-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9977
-- Map@3: 0.7303
 ## Model description
@@ -35,7 +33,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 0
@@ -50,29 +48,29 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Map@3  |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 1.6031        | 0.04  | 200  | 1.3487          | 0.6587 |
-| 1.5042        | 0.08  | 400  | 1.2233          | 0.6620 |
-| 1.4276        | 0.13  | 600  | 1.2073          | 0.6857 |
-| 1.3488        | 0.17  | 800  | 1.1651          | 0.6857 |
-| 1.3509        | 0.21  | 1000 | 1.1356          | 0.6982 |
-| 1.3164        | 0.25  | 1200 | 1.1142          | 0.7017 |
-| 1.3237        | 0.29  | 1400 | 1.0661          | 0.7027 |
-| 1.2723        | 0.34  | 1600 | 1.0683          | 0.7225 |
-| 1.2762        | 0.38  | 1800 | 1.0722          | 0.7098 |
-| 1.298         | 0.42  | 2000 | 1.0303          | 0.7215 |
-| 1.256         | 0.46  | 2200 | 1.0969          | 0.7165 |
-| 1.2629        | 0.51  | 2400 | 1.0432          | 0.7193 |
-| 1.2527        | 0.55  | 2600 | 1.0621          | 0.7220 |
-| 1.1922        | 0.59  | 2800 | 1.0131          | 0.7270 |
-| 1.2399        | 0.63  | 3000 | 1.0079          | 0.7297 |
-| 1.2404        | 0.67  | 3200 | 1.0044          | 0.7292 |
-| 1.24          | 0.72  | 3400 | 1.0515          | 0.7263 |
-| 1.216         | 0.76  | 3600 | 1.0160          | 0.7233 |
-| 1.1973        | 0.8   | 3800 | 1.0072          | 0.7323 |
-| 1.2339        | 0.84  | 4000 | 1.0013          | 0.7318 |
-| 1.2114        | 0.88  | 4200 | 1.0025          | 0.7272 |
-| 1.2262        | 0.93  | 4400 | 0.9980          | 0.7297 |
-| 1.2621        | 0.97  | 4600 | 0.9977          | 0.7303 |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 model-index:
 # checkpoints_3_12
+This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0095
+- Map@3: 0.7278
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 0
 | Training Loss | Epoch | Step | Validation Loss | Map@3  |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 1.2279        | 0.04  | 200  | 1.0085          | 0.7305 |
+| 1.2068        | 0.08  | 400  | 1.0804          | 0.7313 |
+| 1.1809        | 0.13  | 600  | 1.0225          | 0.7302 |
+| 1.0666        | 0.17  | 800  | 1.2032          | 0.7248 |
+| 1.0671        | 0.21  | 1000 | 1.0308          | 0.7243 |
+| 1.0396        | 0.25  | 1200 | 1.0818          | 0.7183 |
+| 1.0183        | 0.29  | 1400 | 1.1960          | 0.7205 |
+| 0.9193        | 0.34  | 1600 | 1.2615          | 0.7072 |
+| 0.9277        | 0.38  | 1800 | 1.1993          | 0.7230 |
+| 0.9777        | 0.42  | 2000 | 1.2120          | 0.7203 |
+| 0.9021        | 0.46  | 2200 | 1.4372          | 0.7208 |
+| 0.9538        | 0.51  | 2400 | 1.1713          | 0.7167 |
+| 0.9536        | 0.55  | 2600 | 1.2319          | 0.7225 |
+| 0.8825        | 0.59  | 2800 | 1.1445          | 0.7257 |
+| 0.9923        | 0.63  | 3000 | 1.0981          | 0.7195 |
+| 1.0443        | 0.67  | 3200 | 1.0991          | 0.7268 |
+| 1.0926        | 0.72  | 3400 | 1.1384          | 0.7332 |
+| 1.1126        | 0.76  | 3600 | 1.0627          | 0.7287 |
+| 1.1415        | 0.8   | 3800 | 1.0397          | 0.7317 |
+| 1.2765        | 0.84  | 4000 | 1.0114          | 0.7285 |
+| 1.2241        | 0.88  | 4200 | 1.0126          | 0.7295 |
+| 1.2353        | 0.93  | 4400 | 1.0098          | 0.7278 |
+| 1.275         | 0.97  | 4600 | 1.0095          | 0.7278 |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0fbf2de9b66745ab45358a335c8f840d66f4cc90d72f981ddb4eed845e7043c3
 size 1740387701

 version https://git-lfs.github.com/spec/v1
+oid sha256:bc5eeb7dce96ca13d492f301c6b04d7e77e031615aeaf034cb7b7d1c1a517bf8
 size 1740387701