Posting these Qwen-14B-Chat quantized models in GGUF format for use with llama.cpp due to a user request.
But, having used an importance matrix derived from English-only training data in the quantization, I have no idea how these models will perform in Chinese.
- Downloads last month
- 70
Hardware compatibility
Log In to add your hardware
We're not able to determine the quantization variants.
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support