Instructions to use froggeric/Qwen-Fixed-Chat-Templates with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use froggeric/Qwen-Fixed-Chat-Templates with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Qwen-Fixed-Chat-Templates froggeric/Qwen-Fixed-Chat-Templates
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
v16 - Reasoning despite setting --reasoning off in llama.cpp
am i doing something wrong or is this expected behaviour? it is having a lot of problems with tool calling, tested web search and browser navigation in hermes so far, with qwen3.6 35b. and i see very often the thinking blocks, although i turned thinking off by setting --reasoning off in llama.cpp command.
Edit: the thinking blocks dont appear at all with the built in template. using the UD-IQ4_NL-XL quant
Edit: the thinking blocks dont appear at all with the built in template. using the UD-IQ4_NL-XL quant
Solved in the final v16 release which I have now promoted to official release.
great! thanks for the response. i tried it and it looks better when reasoning is on, but when reasoning is off, it fails tool calls and just stops working with API call errors