chemlm-0.83m / deepspeed_config.json
sagawa's picture
Upload 8 files
31a2690 verified
{
"gradient_clipping": 0.0,
"steps_per_print": 100,
"train_batch_size": 4096,
"train_micro_batch_size_per_gpu": 2048,
"wall_clock_breakdown": false
}