Getting gibberish output
β― curl -s http://172.20.30.201:9003/v1/chat/completions
-H 'Content-Type: application/json'
-d '{
"model": "qwen35-27b-tq3-4s",
"messages": [
{"role": "user", "content": "Reply with exactly: Hello world"}
],
"max_tokens": 64,
"temperature": 0.6,
"top_p": 0.95,
"top_k": 20,
"min_p": 0
}'
{"choices":[{"finish_reason":"length","index":0,"message":{"role":"assistant","content":"β\n\nβ ,ζ\n\n,,β,,,,,,, η»θ³\n\n ,β,βββη» η²β泑βζ―ζ,οΌζ³‘ \n \nθΏ«,,βζζ 泑泑泑泑__ __η²ζη»η»ζ³‘η»ζ³‘η»θ³ζ³‘"}}],"created":1775569105,"model":"qwen35-27b-tq3-4s","system_fingerprint":"b1-1cfa891","object":"chat.completion","usage":{"completion_tokens":64,"prompt_tokens":18,"total_tokens":82,"prompt_tokens_details":{"cached_tokens":0}},"id":"chatcmpl-In15nuGLYL7abWOAtpP8kmp0hVMrszMo","timings":{"cache_n":0,"prompt_n":18,"prompt_ms":1140.862,"prompt_per_token_ms":63.38122222222223,"prompt_per_second":15.777543646821437,"predicted_n":64,"predicted_ms":2453.442,"predicted_per_token_ms":38.33503125,"predicted_per_second":26.085801090875595}}β
--reasoning on --reasoning-budget 0 --temp 0.6 --top-k 20 --min-p 0 --repeat-penalty 1.0
Try above or whatever in the readme
Please get latest from main and try. Thanks