Property Value
epoch 13.33
global_step 105
learning_rate 0
loss 0.2632
total_flos 8,282,766,064,877,568
train_loss 1.585122755595616
train_runtime 2,810.2888
train_samples_per_second 10.76
train_steps_per_second 0.037

wandb chart

Downloads last month
2
Safetensors
Model size
1B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support