Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

RedHatAI
/
Qwen3.5-122B-A10B-NVFP4

Safetensors
qwen3_5_moe
qwen
nvfp4
vllm
compressed-tensors
8-bit precision
Model card Files Files and versions
xet
Community
2
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Any plans on NVFP4 quantization of smaller Qwen3.5 models (like 35B-A3B and 27B)?

#2 opened 28 days ago by
GabrielaCats

Use Qwen2TokenizerFast tokenizer class for vllm support

👍 2
1
#1 opened about 1 month ago by
romaai
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs