Qwen
/

Qwen3-Coder-Next-GGUF

Text Generation

Model card Files Files and versions

littlebird13 commited on Feb 4

Commit

b82fb73

·

verified ·

1 Parent(s): 152eda6

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -58,9 +58,12 @@ Check out our [llama.cpp documentation](https://qwen.readthedocs.io/en/latest/ru
 We advise you to clone [`llama.cpp`](https://github.com/ggerganov/llama.cpp) and install it following the official guide. We follow the latest version of llama.cpp.
 In the following demonstration, we assume that you are running commands under the repository `llama.cpp`.
 ```shell
-./llama-cli -hf Qwen/Qwen3-Coder-Next-Q5_K_M/Qwen3-Coder-Next-00001-of-00004.gguf --jinja -ngl 99 -fa on -sm row --temp 1.0 --top-k 40 --top-p 0.95 --min-p 0 -c 40960 -n 32768 --no-context-shift
 ```
 ## Processing Long Texts

 We advise you to clone [`llama.cpp`](https://github.com/ggerganov/llama.cpp) and install it following the official guide. We follow the latest version of llama.cpp.
 In the following demonstration, we assume that you are running commands under the repository `llama.cpp`.
+```shell
+huggingface-cli download Qwen/Qwen3-Coder-Next-GGUF --include "Qwen3-Coder-Next-Q5_K_M/*"
+```
 ```shell
+./llama-cli -m ./Qwen3-Coder-Next-Q5_K_M/Qwen3-Coder-Next-00001-of-00004.gguf --jinja -ngl 99 -fa on -sm row --temp 1.0 --top-k 40 --top-p 0.95 --min-p 0 -c 40960 -n 32768 --no-context-shift
 ```
 ## Processing Long Texts