chemlm-0.06m / deepspeed_config.json
sagawa's picture
Upload 8 files
f8e66ba verified
{
"gradient_clipping": 0.0,
"steps_per_print": 100,
"train_batch_size": 4096,
"train_micro_batch_size_per_gpu": 4096,
"wall_clock_breakdown": false
}