nanowhale-100m / generation_config.json
cmpatino's picture
cmpatino HF Staff
Upload SmolDeepSeek-V4 100M SFT model (3000 steps on SmolTalk)
964e055 verified
raw
history blame contribute delete
226 Bytes
{
"_from_model_config": true,
"bos_token_id": 0,
"eos_token_id": [
1
],
"output_attentions": false,
"output_hidden_states": false,
"pad_token_id": 1,
"transformers_version": "5.6.2",
"use_cache": false
}