Add documentation on how to use with vLLM to README.md

#7
by stelterlab - opened

Please add the necessary arguments for running this model with vLLM and SGLang. It seems to be:

    --tool-call-parser qwen3_coder \
    --enable-auto-tool-choice \
    --reasoning-parser qwen3 \
    --trust-remote-code 

for vLLM.

Hi @stelterlab , thanks for pointing out. We will add the arguments for vllm and sglang serving.

And just to correct:
The reasoner parser suitable should be deepseek_r1 or nemotron_v3 instead of qwen3.

Sign up or log in to comment