llimba-3b-instruct-cpt / training_metrics.json
lballore's picture
Initial release of llimba-3b-instruct-cpt
6d1df70
raw
history blame
178 Bytes
{
"train_runtime": 19339.1514,
"train_samples_per_second": 1.98,
"train_steps_per_second": 0.124,
"total_flos": 6.402336720194765e+17,
"train_loss": 1.952849349083259
}