joerowell commited on
Commit
be45e1e
·
verified ·
1 Parent(s): 30ca10b

Enable thinking by default in recommended vllm serve command

Browse files

Adds --default-chat-template-kwargs '{"enable_thinking": true}' so the local recipe matches the Poolside API behavior and the 'preserved thinking is recommended' guidance elsewhere in the card. Without this flag, the chat template defaults enable_thinking to false.

Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -140,7 +140,8 @@ VLLM_USE_DEEP_GEMM=0 vllm serve \
140
  --tool-call-parser poolside_v1 \
141
  --reasoning-parser poolside_v1 \
142
  --enable-auto-tool-choice \
143
- --served-model-name laguna
 
144
  ```
145
 
146
  See the [vLLM recipes page](https://recipes.vllm.ai/poolside/Laguna-XS.2) for additional deployment guidance.
 
140
  --tool-call-parser poolside_v1 \
141
  --reasoning-parser poolside_v1 \
142
  --enable-auto-tool-choice \
143
+ --served-model-name laguna \
144
+ --default-chat-template-kwargs '{"enable_thinking": true}'
145
  ```
146
 
147
  See the [vLLM recipes page](https://recipes.vllm.ai/poolside/Laguna-XS.2) for additional deployment guidance.