Quantizations
Collection
All GGUF quants that I have made so far, and demos too. • 6 items • Updated
This model was converted to GGUF format from unsloth/Qwen2-0.5B-Instruct using llama.cpp via the ggml.ai's GGUF-my-repo space.
Refer to the original model card for more details on the model.
llama-cli --hf-repo Fu01978/Qwen2-0.5B-Instruct-Q2_K-GGUF --hf-file qwen2-0.5b-instruct-q2_k.gguf -p "The meaning to life and the universe is"
llama-server --hf-repo Fu01978/Qwen2-0.5B-Instruct-Q2_K-GGUF --hf-file qwen2-0.5b-instruct-q2_k.gguf -c 2048
2-bit
Base model
unsloth/Qwen2-0.5B-Instruct