MIMIGenRec
Collection
A collection of MIMIGenRec ckpt, including sft and rl model • 8 items • Updated
This model is a fine-tuned version of Qwen/Qwen2.5-0.5B-Instruct on the Toys_and_Games_train dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 4.7233 | 0.5039 | 16 | 6.3760 |
| 2.8934 | 1.0 | 32 | 3.5466 |
| 2.111 | 1.5039 | 48 | 3.0951 |
| 1.658 | 2.0 | 64 | 2.9089 |
| 1.261 | 2.5039 | 80 | 2.7283 |
| 1.1549 | 3.0 | 96 | 2.5239 |
| 0.9521 | 3.5039 | 112 | 2.3141 |
| 0.9135 | 4.0 | 128 | 2.1239 |
| 0.8711 | 4.5039 | 144 | 1.9950 |
| 0.821 | 5.0 | 160 | 1.8966 |
| 0.7841 | 5.5039 | 176 | 1.8238 |
| 0.7771 | 6.0 | 192 | 1.7719 |
| 0.7577 | 6.5039 | 208 | 1.7509 |
| 0.7521 | 7.0 | 224 | 1.7131 |
| 0.7261 | 7.5039 | 240 | 1.6915 |
| 0.7488 | 8.0 | 256 | 1.6656 |
| 0.7328 | 8.5039 | 272 | 1.6585 |
| 0.7185 | 9.0 | 288 | 1.6499 |
| 0.7286 | 9.5039 | 304 | 1.6461 |
| 0.6968 | 10.0 | 320 | 1.6463 |