Corrected typo in TensorRT-LLM serving command

by 1MrazorT1 - opened Mar 19

←

Mar 19

•

Problem

The TRT-LLM serving command in the README uses --reasoning_parser nano_v3
(underscore), but the valid options listed by trtllm-serve --help are:

--reasoning_parser [deepseek-r1|qwen3|nano-v3]

Using nano_v3 causes an error when running the command.

Changed nano_v3 to nano-v3 to match the actual CLI option.

Tested on TensorRT-LLM release:1.3.0rc5 with
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 on 4× H100-80GB.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment