xayysomw_20250703_033117
This model is a fine-tuned version of meta-llama/Llama-3.2-1B on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.6087
- Model Preparation Time: 0.0078
- Move Accuracy: 0.2450
- Token Accuracy: 0.7645
- Accuracy: 0.2450
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0003
- train_batch_size: 128
- eval_batch_size: 256
- seed: 42
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: constant_with_warmup
- lr_scheduler_warmup_ratio: 0.001
- num_epochs: 100
Training results
| Training Loss | Epoch | Step | Validation Loss | Model Preparation Time | Move Accuracy | Token Accuracy | Accuracy |
|---|---|---|---|---|---|---|---|
| No log | 0 | 0 | 11.9474 | 0.0078 | 0.0 | 0.0000 | 0.0 |
| 2.6223 | 0.0098 | 100 | 2.5852 | 0.0078 | 0.0 | 0.2975 | 0.0 |
| 1.6292 | 0.0196 | 200 | 1.6152 | 0.0078 | 0.0022 | 0.3807 | 0.0022 |
| 1.4561 | 0.0295 | 300 | 1.4943 | 0.0078 | 0.0082 | 0.4221 | 0.0082 |
| 1.4191 | 0.0393 | 400 | 1.4635 | 0.0078 | 0.0031 | 0.4376 | 0.0031 |
| 1.4016 | 0.0491 | 500 | 1.4442 | 0.0078 | 0.0034 | 0.4358 | 0.0034 |
| 1.3329 | 0.0589 | 600 | 1.3917 | 0.0078 | 0.0088 | 0.4639 | 0.0088 |
| 1.3572 | 0.0687 | 700 | 1.3693 | 0.0078 | 0.0129 | 0.4759 | 0.0129 |
| 1.3993 | 0.0785 | 800 | 1.3327 | 0.0078 | 0.0200 | 0.4888 | 0.0200 |
| 1.302 | 0.0884 | 900 | 1.2935 | 0.0078 | 0.0322 | 0.5054 | 0.0322 |
| 1.3238 | 0.0982 | 1000 | 1.2859 | 0.0078 | 0.0288 | 0.5078 | 0.0288 |
| 1.2399 | 0.1080 | 1100 | 1.2559 | 0.0078 | 0.0376 | 0.5176 | 0.0376 |
| 1.1942 | 0.1178 | 1200 | 1.2282 | 0.0078 | 0.0444 | 0.5295 | 0.0444 |
| 1.208 | 0.1276 | 1300 | 1.2138 | 0.0078 | 0.0429 | 0.5369 | 0.0429 |
| 1.2147 | 0.1374 | 1400 | 1.1758 | 0.0078 | 0.0532 | 0.5516 | 0.0532 |
| 1.1029 | 0.1473 | 1500 | 1.1540 | 0.0078 | 0.0545 | 0.5586 | 0.0545 |
| 1.0699 | 0.1571 | 1600 | 1.1027 | 0.0078 | 0.0636 | 0.5770 | 0.0636 |
| 1.1063 | 0.1669 | 1700 | 1.0957 | 0.0078 | 0.0603 | 0.5808 | 0.0603 |
| 1.094 | 0.1767 | 1800 | 1.0698 | 0.0078 | 0.0662 | 0.5889 | 0.0662 |
| 0.9785 | 0.1865 | 1900 | 1.0385 | 0.0078 | 0.0727 | 0.6004 | 0.0727 |
| 1.012 | 0.1963 | 2000 | 1.0164 | 0.0078 | 0.0748 | 0.6100 | 0.0748 |
| 0.9995 | 0.2062 | 2100 | 1.0061 | 0.0078 | 0.0788 | 0.6138 | 0.0788 |
| 0.98 | 0.2160 | 2200 | 0.9693 | 0.0078 | 0.0934 | 0.6293 | 0.0934 |
| 0.9847 | 0.2258 | 2300 | 0.9623 | 0.0078 | 0.0935 | 0.6350 | 0.0935 |
| 0.8477 | 0.2356 | 2400 | 0.9403 | 0.0078 | 0.1006 | 0.6413 | 0.1006 |
| 0.9014 | 0.2454 | 2500 | 0.9165 | 0.0078 | 0.1069 | 0.6493 | 0.1069 |
| 0.8881 | 0.2553 | 2600 | 0.9062 | 0.0078 | 0.1118 | 0.6548 | 0.1118 |
| 0.9052 | 0.2651 | 2700 | 0.8721 | 0.0078 | 0.1257 | 0.6688 | 0.1257 |
| 0.8653 | 0.2749 | 2800 | 0.8700 | 0.0078 | 0.1227 | 0.6690 | 0.1227 |
| 0.8361 | 0.2847 | 2900 | 0.8592 | 0.0078 | 0.1234 | 0.6721 | 0.1234 |
| 0.8225 | 0.2945 | 3000 | 0.8568 | 0.0078 | 0.1309 | 0.6738 | 0.1309 |
| 0.815 | 0.3043 | 3100 | 0.8286 | 0.0078 | 0.1411 | 0.6871 | 0.1411 |
| 0.778 | 0.3142 | 3200 | 0.8281 | 0.0078 | 0.1392 | 0.6878 | 0.1392 |
| 0.826 | 0.3240 | 3300 | 0.8032 | 0.0078 | 0.1491 | 0.6950 | 0.1491 |
| 0.76 | 0.3338 | 3400 | 0.8057 | 0.0078 | 0.1437 | 0.6943 | 0.1437 |
| 0.7552 | 0.3436 | 3500 | 0.8020 | 0.0078 | 0.1479 | 0.6967 | 0.1479 |
| 0.822 | 0.3534 | 3600 | 0.7959 | 0.0078 | 0.1495 | 0.7002 | 0.1495 |
| 0.7419 | 0.3632 | 3700 | 0.7866 | 0.0078 | 0.1544 | 0.6995 | 0.1544 |
| 0.7857 | 0.3731 | 3800 | 0.7788 | 0.0078 | 0.1604 | 0.7060 | 0.1604 |
| 0.7362 | 0.3829 | 3900 | 0.7693 | 0.0078 | 0.1675 | 0.7096 | 0.1675 |
| 0.7858 | 0.3927 | 4000 | 0.7651 | 0.0078 | 0.1694 | 0.7091 | 0.1694 |
| 0.7668 | 0.4025 | 4100 | 0.7603 | 0.0078 | 0.1652 | 0.7115 | 0.1652 |
| 0.7137 | 0.4123 | 4200 | 0.7684 | 0.0078 | 0.1646 | 0.7094 | 0.1646 |
| 0.7357 | 0.4221 | 4300 | 0.7546 | 0.0078 | 0.1686 | 0.7125 | 0.1686 |
| 0.774 | 0.4320 | 4400 | 0.7456 | 0.0078 | 0.1744 | 0.7151 | 0.1744 |
| 0.7199 | 0.4418 | 4500 | 0.7300 | 0.0078 | 0.1801 | 0.7219 | 0.1801 |
| 0.709 | 0.4516 | 4600 | 0.7255 | 0.0078 | 0.1818 | 0.7228 | 0.1818 |
| 0.7138 | 0.4614 | 4700 | 0.7334 | 0.0078 | 0.1764 | 0.7196 | 0.1764 |
| 0.7234 | 0.4712 | 4800 | 0.7086 | 0.0078 | 0.1873 | 0.7290 | 0.1873 |
| 0.6689 | 0.4811 | 4900 | 0.7173 | 0.0078 | 0.1952 | 0.7280 | 0.1952 |
| 0.7196 | 0.4909 | 5000 | 0.7125 | 0.0078 | 0.1898 | 0.7264 | 0.1898 |
| 0.724 | 0.5007 | 5100 | 0.7079 | 0.0078 | 0.1892 | 0.7290 | 0.1892 |
| 0.6905 | 0.5105 | 5200 | 0.7050 | 0.0078 | 0.1911 | 0.7319 | 0.1911 |
| 0.7294 | 0.5203 | 5300 | 0.6911 | 0.0078 | 0.1949 | 0.7338 | 0.1949 |
| 0.6694 | 0.5301 | 5400 | 0.6894 | 0.0078 | 0.1969 | 0.7362 | 0.1969 |
| 0.6852 | 0.5400 | 5500 | 0.6893 | 0.0078 | 0.1944 | 0.7354 | 0.1944 |
| 0.7001 | 0.5498 | 5600 | 0.6922 | 0.0078 | 0.1996 | 0.7343 | 0.1996 |
| 0.6049 | 0.5596 | 5700 | 0.6868 | 0.0078 | 0.2011 | 0.7359 | 0.2011 |
| 0.6958 | 0.5694 | 5800 | 0.6844 | 0.0078 | 0.2000 | 0.7371 | 0.2000 |
| 0.7016 | 0.5792 | 5900 | 0.6805 | 0.0078 | 0.2013 | 0.7389 | 0.2013 |
| 0.6737 | 0.5890 | 6000 | 0.6773 | 0.0078 | 0.1978 | 0.7394 | 0.1978 |
| 0.6724 | 0.5989 | 6100 | 0.6719 | 0.0078 | 0.2049 | 0.7423 | 0.2049 |
| 0.6649 | 0.6087 | 6200 | 0.6684 | 0.0078 | 0.2085 | 0.7449 | 0.2085 |
| 0.6484 | 0.6185 | 6300 | 0.6670 | 0.0078 | 0.2161 | 0.7462 | 0.2161 |
| 0.5971 | 0.6283 | 6400 | 0.6601 | 0.0078 | 0.2120 | 0.7452 | 0.2120 |
| 0.7053 | 0.6381 | 6500 | 0.6658 | 0.0078 | 0.2118 | 0.7441 | 0.2118 |
| 0.7036 | 0.6479 | 6600 | 0.6635 | 0.0078 | 0.2088 | 0.7458 | 0.2088 |
| 0.6133 | 0.6578 | 6700 | 0.6626 | 0.0078 | 0.2170 | 0.7453 | 0.2170 |
| 0.6729 | 0.6676 | 6800 | 0.6560 | 0.0078 | 0.2158 | 0.7466 | 0.2158 |
| 0.5972 | 0.6774 | 6900 | 0.6538 | 0.0078 | 0.2113 | 0.7476 | 0.2113 |
| 0.5957 | 0.6872 | 7000 | 0.6530 | 0.0078 | 0.2154 | 0.7495 | 0.2154 |
| 0.6481 | 0.6970 | 7100 | 0.6428 | 0.0078 | 0.2210 | 0.7513 | 0.2210 |
| 0.5845 | 0.7069 | 7200 | 0.6461 | 0.0078 | 0.2146 | 0.7500 | 0.2146 |
| 0.5844 | 0.7167 | 7300 | 0.6470 | 0.0078 | 0.2252 | 0.7512 | 0.2252 |
| 0.6055 | 0.7265 | 7400 | 0.6429 | 0.0078 | 0.2212 | 0.7521 | 0.2212 |
| 0.5889 | 0.7363 | 7500 | 0.6364 | 0.0078 | 0.2303 | 0.7555 | 0.2303 |
| 0.6503 | 0.7461 | 7600 | 0.6409 | 0.0078 | 0.2231 | 0.7539 | 0.2231 |
| 0.6798 | 0.7559 | 7700 | 0.6441 | 0.0078 | 0.2241 | 0.7538 | 0.2241 |
| 0.6232 | 0.7658 | 7800 | 0.6377 | 0.0078 | 0.2274 | 0.7538 | 0.2274 |
| 0.5737 | 0.7756 | 7900 | 0.6309 | 0.0078 | 0.2301 | 0.7573 | 0.2301 |
| 0.6126 | 0.7854 | 8000 | 0.6290 | 0.0078 | 0.2315 | 0.7583 | 0.2315 |
| 0.5827 | 0.7952 | 8100 | 0.6388 | 0.0078 | 0.2290 | 0.7554 | 0.2290 |
| 0.6219 | 0.8050 | 8200 | 0.6339 | 0.0078 | 0.2229 | 0.7545 | 0.2229 |
| 0.5511 | 0.8148 | 8300 | 0.6330 | 0.0078 | 0.2305 | 0.7580 | 0.2305 |
| 0.6327 | 0.8247 | 8400 | 0.6267 | 0.0078 | 0.2364 | 0.7580 | 0.2364 |
| 0.5984 | 0.8345 | 8500 | 0.6201 | 0.0078 | 0.2328 | 0.7599 | 0.2328 |
| 0.6267 | 0.8443 | 8600 | 0.6317 | 0.0078 | 0.2259 | 0.7564 | 0.2259 |
| 0.6336 | 0.8541 | 8700 | 0.6194 | 0.0078 | 0.2329 | 0.7612 | 0.2329 |
| 0.6098 | 0.8639 | 8800 | 0.6232 | 0.0078 | 0.2337 | 0.7594 | 0.2337 |
| 0.6379 | 0.8737 | 8900 | 0.6266 | 0.0078 | 0.2328 | 0.7579 | 0.2328 |
| 0.6155 | 0.8836 | 9000 | 0.6191 | 0.0078 | 0.2338 | 0.7611 | 0.2338 |
| 0.5573 | 0.8934 | 9100 | 0.6209 | 0.0078 | 0.2353 | 0.7614 | 0.2353 |
| 0.5894 | 0.9032 | 9200 | 0.6221 | 0.0078 | 0.2276 | 0.7592 | 0.2276 |
| 0.633 | 0.9130 | 9300 | 0.6215 | 0.0078 | 0.2292 | 0.7576 | 0.2292 |
| 0.5393 | 0.9228 | 9400 | 0.6174 | 0.0078 | 0.2366 | 0.7618 | 0.2366 |
| 0.5929 | 0.9327 | 9500 | 0.6130 | 0.0078 | 0.2420 | 0.7644 | 0.2420 |
| 0.5917 | 0.9425 | 9600 | 0.6249 | 0.0078 | 0.2282 | 0.7582 | 0.2282 |
| 0.6693 | 0.9523 | 9700 | 0.6117 | 0.0078 | 0.2340 | 0.7629 | 0.2340 |
| 0.6209 | 0.9621 | 9800 | 0.6200 | 0.0078 | 0.2354 | 0.7611 | 0.2354 |
| 0.629 | 0.9719 | 9900 | 0.6204 | 0.0078 | 0.2291 | 0.7578 | 0.2291 |
| 0.6288 | 0.9817 | 10000 | 0.6101 | 0.0078 | 0.2338 | 0.7625 | 0.2338 |
| 0.6165 | 0.9916 | 10100 | 0.6198 | 0.0078 | 0.2351 | 0.7614 | 0.2351 |
| 0.5815 | 1.0014 | 10200 | 0.6128 | 0.0078 | 0.2394 | 0.7631 | 0.2394 |
| 0.6537 | 1.0112 | 10300 | 0.6353 | 0.0078 | 0.2232 | 0.7546 | 0.2232 |
| 0.6097 | 1.0210 | 10400 | 0.6157 | 0.0078 | 0.2360 | 0.7633 | 0.2360 |
| 0.5934 | 1.0308 | 10500 | 0.6114 | 0.0078 | 0.2427 | 0.7636 | 0.2427 |
| 0.56 | 1.0406 | 10600 | 0.6125 | 0.0078 | 0.2362 | 0.7628 | 0.2362 |
| 0.5701 | 1.0505 | 10700 | 0.6033 | 0.0078 | 0.2431 | 0.7672 | 0.2431 |
| 0.6163 | 1.0603 | 10800 | 0.6080 | 0.0078 | 0.2429 | 0.7640 | 0.2429 |
| 0.595 | 1.0701 | 10900 | 0.6040 | 0.0078 | 0.2409 | 0.7654 | 0.2409 |
| 0.6046 | 1.0799 | 11000 | 0.6121 | 0.0078 | 0.2401 | 0.7637 | 0.2401 |
| 0.6036 | 1.0897 | 11100 | 0.6099 | 0.0078 | 0.2425 | 0.7644 | 0.2425 |
| 0.6374 | 1.0995 | 11200 | 0.6000 | 0.0078 | 0.2445 | 0.7680 | 0.2445 |
| 0.5985 | 1.1094 | 11300 | 0.6086 | 0.0078 | 0.2386 | 0.7642 | 0.2386 |
| 0.5989 | 1.1192 | 11400 | 0.6023 | 0.0078 | 0.2426 | 0.7664 | 0.2426 |
| 0.5679 | 1.1290 | 11500 | 0.6018 | 0.0078 | 0.2402 | 0.7656 | 0.2402 |
| 0.6057 | 1.1388 | 11600 | 0.6111 | 0.0078 | 0.2442 | 0.7627 | 0.2442 |
| 0.6422 | 1.1486 | 11700 | 0.6114 | 0.0078 | 0.2361 | 0.7623 | 0.2361 |
| 0.6049 | 1.1585 | 11800 | 0.6020 | 0.0078 | 0.2442 | 0.7665 | 0.2442 |
| 0.6354 | 1.1683 | 11900 | 0.6067 | 0.0078 | 0.2368 | 0.7648 | 0.2368 |
| 0.572 | 1.1781 | 12000 | 0.6110 | 0.0078 | 0.2409 | 0.7646 | 0.2409 |
| 0.5778 | 1.1879 | 12100 | 0.6146 | 0.0078 | 0.2337 | 0.7630 | 0.2337 |
| 0.5793 | 1.1977 | 12200 | 0.6087 | 0.0078 | 0.2450 | 0.7645 | 0.2450 |
| 0.6172 | 1.2075 | 12300 | 0.6258 | 0.0078 | 0.2307 | 0.7582 | 0.2307 |
| 0.5875 | 1.2174 | 12400 | 0.6162 | 0.0078 | 0.2331 | 0.7625 | 0.2331 |
| 0.6579 | 1.2272 | 12500 | 0.6212 | 0.0078 | 0.2232 | 0.7589 | 0.2232 |
| 0.5902 | 1.2370 | 12600 | 0.6172 | 0.0078 | 0.2255 | 0.7600 | 0.2255 |
| 0.6025 | 1.2468 | 12700 | 0.6121 | 0.0078 | 0.2374 | 0.7639 | 0.2374 |
| 0.6108 | 1.2566 | 12800 | 0.6052 | 0.0078 | 0.2367 | 0.7659 | 0.2367 |
| 0.5819 | 1.2664 | 12900 | 0.6055 | 0.0078 | 0.2336 | 0.7645 | 0.2336 |
| 0.6334 | 1.2763 | 13000 | 0.6375 | 0.0078 | 0.2201 | 0.7532 | 0.2201 |
| 0.6051 | 1.2861 | 13100 | 0.6075 | 0.0078 | 0.2346 | 0.7646 | 0.2346 |
| 0.6465 | 1.2959 | 13200 | 0.6244 | 0.0078 | 0.2349 | 0.7581 | 0.2349 |
| 0.59 | 1.3057 | 13300 | 0.6050 | 0.0078 | 0.2349 | 0.7639 | 0.2349 |
| 0.5533 | 1.3155 | 13400 | 0.6116 | 0.0078 | 0.2375 | 0.7625 | 0.2375 |
| 0.5665 | 1.3253 | 13500 | 0.6146 | 0.0078 | 0.2300 | 0.7611 | 0.2300 |
| 0.6111 | 1.3352 | 13600 | 0.6183 | 0.0078 | 0.2298 | 0.7598 | 0.2298 |
| 0.6179 | 1.3450 | 13700 | 0.6167 | 0.0078 | 0.2347 | 0.7610 | 0.2347 |
| 0.6473 | 1.3548 | 13800 | 0.6239 | 0.0078 | 0.2335 | 0.7589 | 0.2335 |
| 0.5681 | 1.3646 | 13900 | 0.6127 | 0.0078 | 0.2380 | 0.7624 | 0.2380 |
| 0.6221 | 1.3744 | 14000 | 0.6158 | 0.0078 | 0.2284 | 0.7611 | 0.2284 |
| 0.6095 | 1.3843 | 14100 | 0.6195 | 0.0078 | 0.2275 | 0.7609 | 0.2275 |
| 0.5977 | 1.3941 | 14200 | 0.6019 | 0.0078 | 0.2369 | 0.7669 | 0.2369 |
| 0.6232 | 1.4039 | 14300 | 0.6157 | 0.0078 | 0.2345 | 0.7627 | 0.2345 |
| 0.5975 | 1.4137 | 14400 | 0.6131 | 0.0078 | 0.2339 | 0.7622 | 0.2339 |
| 0.639 | 1.4235 | 14500 | 0.6147 | 0.0078 | 0.2328 | 0.7613 | 0.2328 |
| 0.6194 | 1.4333 | 14600 | 0.6215 | 0.0078 | 0.2212 | 0.7592 | 0.2212 |
| 0.559 | 1.4432 | 14700 | 0.6107 | 0.0078 | 0.2325 | 0.7634 | 0.2325 |
| 0.6177 | 1.4530 | 14800 | 0.6287 | 0.0078 | 0.2265 | 0.7566 | 0.2265 |
| 0.6685 | 1.4628 | 14900 | 0.6264 | 0.0078 | 0.2210 | 0.7584 | 0.2210 |
| 0.6171 | 1.4726 | 15000 | 0.6321 | 0.0078 | 0.2210 | 0.7575 | 0.2210 |
| 0.6483 | 1.4824 | 15100 | 0.6298 | 0.0078 | 0.2276 | 0.7591 | 0.2276 |
| 0.628 | 1.4922 | 15200 | 0.6281 | 0.0078 | 0.2217 | 0.7554 | 0.2217 |
| 0.5978 | 1.5021 | 15300 | 0.6383 | 0.0078 | 0.2230 | 0.7536 | 0.2230 |
| 0.5902 | 1.5119 | 15400 | 0.6301 | 0.0078 | 0.2257 | 0.7549 | 0.2257 |
| 0.6467 | 1.5217 | 15500 | 0.6355 | 0.0078 | 0.2262 | 0.7533 | 0.2262 |
| 0.5873 | 1.5315 | 15600 | 0.6314 | 0.0078 | 0.2221 | 0.7552 | 0.2221 |
| 0.6911 | 1.5413 | 15700 | 0.6322 | 0.0078 | 0.2244 | 0.7568 | 0.2244 |
| 0.6346 | 1.5511 | 15800 | 0.6509 | 0.0078 | 0.2110 | 0.7492 | 0.2110 |
| 0.6008 | 1.5610 | 15900 | 0.6433 | 0.0078 | 0.2139 | 0.7509 | 0.2139 |
| 0.6435 | 1.5708 | 16000 | 0.6473 | 0.0078 | 0.2176 | 0.7506 | 0.2176 |
| 0.6599 | 1.5806 | 16100 | 0.6534 | 0.0078 | 0.2112 | 0.7487 | 0.2112 |
| 0.6332 | 1.5904 | 16200 | 0.6328 | 0.0078 | 0.2138 | 0.7531 | 0.2138 |
| 0.6952 | 1.6002 | 16300 | 0.6423 | 0.0078 | 0.2186 | 0.7528 | 0.2186 |
| 0.6927 | 1.6101 | 16400 | 0.6365 | 0.0078 | 0.2141 | 0.7540 | 0.2141 |
| 0.6372 | 1.6199 | 16500 | 0.6292 | 0.0078 | 0.2175 | 0.7537 | 0.2175 |
| 0.6765 | 1.6297 | 16600 | 0.6570 | 0.0078 | 0.2052 | 0.7471 | 0.2052 |
| 0.6411 | 1.6395 | 16700 | 0.6427 | 0.0078 | 0.2173 | 0.7522 | 0.2173 |
| 0.694 | 1.6493 | 16800 | 0.7134 | 0.0078 | 0.1862 | 0.7281 | 0.1862 |
| 0.634 | 1.6591 | 16900 | 0.6267 | 0.0078 | 0.2195 | 0.7563 | 0.2195 |
| 0.5748 | 1.6690 | 17000 | 0.6175 | 0.0078 | 0.2328 | 0.7599 | 0.2328 |
| 0.6457 | 1.6788 | 17100 | 0.6218 | 0.0078 | 0.2299 | 0.7570 | 0.2299 |
| 0.6617 | 1.6886 | 17200 | 0.6738 | 0.0078 | 0.2092 | 0.7415 | 0.2092 |
| 0.6106 | 1.6984 | 17300 | 0.6207 | 0.0078 | 0.2308 | 0.7604 | 0.2308 |
| 0.5868 | 1.7082 | 17400 | 0.6408 | 0.0078 | 0.2206 | 0.7511 | 0.2206 |
| 0.6524 | 1.7180 | 17500 | 0.6412 | 0.0078 | 0.2190 | 0.7520 | 0.2190 |
| 0.6017 | 1.7279 | 17600 | 0.6477 | 0.0078 | 0.2097 | 0.7489 | 0.2097 |
| 0.6908 | 1.7377 | 17700 | 0.6321 | 0.0078 | 0.2239 | 0.7577 | 0.2239 |
| 0.6504 | 1.7475 | 17800 | 0.6324 | 0.0078 | 0.2174 | 0.7558 | 0.2174 |
| 0.6495 | 1.7573 | 17900 | 0.6287 | 0.0078 | 0.2231 | 0.7566 | 0.2231 |
| 0.6319 | 1.7671 | 18000 | 0.6461 | 0.0078 | 0.2033 | 0.7486 | 0.2033 |
| 0.635 | 1.7769 | 18100 | 0.6372 | 0.0078 | 0.2182 | 0.7539 | 0.2182 |
| 0.6593 | 1.7868 | 18200 | 0.6336 | 0.0078 | 0.2186 | 0.7520 | 0.2186 |
| 0.6831 | 1.7966 | 18300 | 0.6425 | 0.0078 | 0.2043 | 0.7496 | 0.2043 |
| 0.6132 | 1.8064 | 18400 | 0.6316 | 0.0078 | 0.2165 | 0.7538 | 0.2165 |
| 0.6062 | 1.8162 | 18500 | 0.6462 | 0.0078 | 0.2085 | 0.7486 | 0.2085 |
| 0.6509 | 1.8260 | 18600 | 0.6572 | 0.0078 | 0.2040 | 0.7456 | 0.2040 |
| 0.6316 | 1.8359 | 18700 | 0.6596 | 0.0078 | 0.2065 | 0.7458 | 0.2065 |
| 0.5951 | 1.8457 | 18800 | 0.6455 | 0.0078 | 0.2076 | 0.7484 | 0.2076 |
| 0.6158 | 1.8555 | 18900 | 0.6451 | 0.0078 | 0.2119 | 0.7504 | 0.2119 |
| 0.6505 | 1.8653 | 19000 | 0.6439 | 0.0078 | 0.2136 | 0.7497 | 0.2136 |
| 0.6603 | 1.8751 | 19100 | 0.6975 | 0.0078 | 0.1858 | 0.7327 | 0.1858 |
| 0.5653 | 1.8849 | 19200 | 0.6426 | 0.0078 | 0.2068 | 0.7499 | 0.2068 |
| 0.6986 | 1.8948 | 19300 | 0.6449 | 0.0078 | 0.2170 | 0.7501 | 0.2170 |
| 0.584 | 1.9046 | 19400 | 0.6191 | 0.0078 | 0.2309 | 0.7603 | 0.2309 |
| 0.6381 | 1.9144 | 19500 | 0.6268 | 0.0078 | 0.2216 | 0.7552 | 0.2216 |
| 0.6571 | 1.9242 | 19600 | 0.6502 | 0.0078 | 0.2093 | 0.7472 | 0.2093 |
| 0.6661 | 1.9340 | 19700 | 0.6701 | 0.0078 | 0.2025 | 0.7427 | 0.2025 |
| 0.6508 | 1.9438 | 19800 | 0.6511 | 0.0078 | 0.2051 | 0.7455 | 0.2051 |
| 0.6047 | 1.9537 | 19900 | 0.6455 | 0.0078 | 0.2157 | 0.7514 | 0.2157 |
| 0.6285 | 1.9635 | 20000 | 0.6457 | 0.0078 | 0.2126 | 0.7501 | 0.2126 |
| 0.5984 | 1.9733 | 20100 | 0.6321 | 0.0078 | 0.2221 | 0.7569 | 0.2221 |
| 0.6819 | 1.9831 | 20200 | 0.6396 | 0.0078 | 0.2151 | 0.7520 | 0.2151 |
| 0.6042 | 1.9929 | 20300 | 0.6535 | 0.0078 | 0.2086 | 0.7465 | 0.2086 |
| 0.6213 | 2.0027 | 20400 | 0.6622 | 0.0078 | 0.2034 | 0.7431 | 0.2034 |
| 0.6538 | 2.0126 | 20500 | 0.6816 | 0.0078 | 0.1918 | 0.7376 | 0.1918 |
| 0.6581 | 2.0224 | 20600 | 0.6562 | 0.0078 | 0.2112 | 0.7452 | 0.2112 |
| 0.5773 | 2.0322 | 20700 | 0.6345 | 0.0078 | 0.2195 | 0.7538 | 0.2195 |
| 0.5725 | 2.0420 | 20800 | 0.6363 | 0.0078 | 0.2246 | 0.7544 | 0.2246 |
| 0.6478 | 2.0518 | 20900 | 0.6491 | 0.0078 | 0.2097 | 0.7488 | 0.2097 |
| 0.6167 | 2.0617 | 21000 | 0.6468 | 0.0078 | 0.2150 | 0.7502 | 0.2150 |
| 0.7064 | 2.0715 | 21100 | 0.6870 | 0.0078 | 0.1984 | 0.7343 | 0.1984 |
| 0.7072 | 2.0813 | 21200 | 0.6708 | 0.0078 | 0.1979 | 0.7429 | 0.1979 |
| 0.7423 | 2.0911 | 21300 | 0.7345 | 0.0078 | 0.1780 | 0.7200 | 0.1780 |
| 0.7061 | 2.1009 | 21400 | 0.6791 | 0.0078 | 0.1977 | 0.7379 | 0.1977 |
| 0.6811 | 2.1107 | 21500 | 0.6606 | 0.0078 | 0.2062 | 0.7450 | 0.2062 |
| 0.6401 | 2.1206 | 21600 | 0.6603 | 0.0078 | 0.1963 | 0.7427 | 0.1963 |
| 0.8407 | 2.1304 | 21700 | 0.8006 | 0.0078 | 0.1447 | 0.6986 | 0.1447 |
| 0.7824 | 2.1402 | 21800 | 0.8081 | 0.0078 | 0.1544 | 0.6968 | 0.1544 |
| 0.7872 | 2.1500 | 21900 | 0.7969 | 0.0078 | 0.1548 | 0.6977 | 0.1548 |
| 0.6398 | 2.1598 | 22000 | 0.6618 | 0.0078 | 0.2066 | 0.7416 | 0.2066 |
| 0.7672 | 2.1696 | 22100 | 0.7849 | 0.0078 | 0.1595 | 0.7025 | 0.1595 |
| 0.629 | 2.1795 | 22200 | 0.6783 | 0.0078 | 0.1934 | 0.7369 | 0.1934 |
| 0.7005 | 2.1893 | 22300 | 0.6946 | 0.0078 | 0.1874 | 0.7308 | 0.1874 |
Framework versions
- PEFT 0.15.2
- Transformers 4.51.3
- Pytorch 2.6.0+cu124
- Datasets 3.5.0
- Tokenizers 0.21.1
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for donoway/xayysomw_20250703_033117
Base model
meta-llama/Llama-3.2-1B