Output mess

#11

by FlintyLemming - opened Dec 11, 2025

Dec 11, 2025

Output totally mess

using 4 H200 141GB
both docker and vllm cli

CUDA_VISIBLE_DEVICES=4,5,6,7 vllm serve /mnt/extend/models/llm/mistralai/Devstral-2-123B-Instruct-2512 --tool-call-parser mistral --enable-auto-tool-choice --tensor-parallel-size 2 --served-model-name devstral-2-123b-instruct

docker run -d --name vllm-devstral   --gpus '"device=4,5,6,7"'   -p 8000:8000   -v /mnt/extend/models/llm/mistralai/Devstral-2-123B-Instruct-2512:/model   mistralllm/vllm_devstral:latest   --model /model   --tool-call-parser mistral   --enable-auto-tool-choice   --tensor-parallel-size 4   --served-model-name devstral-2-123b-instruct

FlintyLemming changed discussion status to closed Dec 11, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment