DeepSeek-V4-Flash-W4A16-FP8 / tokenizer_config.json

Commit History

Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts)
2e7ef6a
verified

pastapaul commited on