Enable thinking by default in recommended vllm serve command
Browse filesAdds --default-chat-template-kwargs '{"enable_thinking": true}' so the local recipe matches the Poolside API behavior and the 'preserved thinking is recommended' guidance elsewhere in the card. Without this flag, the chat template defaults enable_thinking to false.
README.md
CHANGED
|
@@ -140,7 +140,8 @@ VLLM_USE_DEEP_GEMM=0 vllm serve \
|
|
| 140 |
--tool-call-parser poolside_v1 \
|
| 141 |
--reasoning-parser poolside_v1 \
|
| 142 |
--enable-auto-tool-choice \
|
| 143 |
-
--served-model-name laguna
|
|
|
|
| 144 |
```
|
| 145 |
|
| 146 |
See the [vLLM recipes page](https://recipes.vllm.ai/poolside/Laguna-XS.2) for additional deployment guidance.
|
|
|
|
| 140 |
--tool-call-parser poolside_v1 \
|
| 141 |
--reasoning-parser poolside_v1 \
|
| 142 |
--enable-auto-tool-choice \
|
| 143 |
+
--served-model-name laguna \
|
| 144 |
+
--default-chat-template-kwargs '{"enable_thinking": true}'
|
| 145 |
```
|
| 146 |
|
| 147 |
See the [vLLM recipes page](https://recipes.vllm.ai/poolside/Laguna-XS.2) for additional deployment guidance.
|