mega-asr-mlx / mlx /llm-mixed8_4 /generation_config.json
Reza2kn's picture
Add mixed 8-bit attn + 4-bit MLP variant (92.2%)
b0f4547 verified
raw
history blame contribute delete
128 Bytes
{
"_from_model_config": true,
"eos_token_id": [
151643,
151645
],
"pad_token_id": 151643,
"do_sample": false
}