Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

canada-quant
/

DeepSeek-V4-Flash-W4A16-FP8-MTP

Text Generation

compressed-tensors

speculative-decoding

mixture-of-experts

Mixture of Experts

Model card Files Files and versions

DeepSeek-V4-Flash-W4A16-FP8-MTP / generation_config.json

Commit History

Phase 2 full GPTQ + BF16 MTP: 89.1% MTP acceptance

e910552
verified

pastapaul commited on 3 days ago