fix chat template to avoid empty historical `<think>` blocks
1
#3 opened 14 days ago
by
latent-variable
vLLM does not need --quantization moe_wna16
#2 opened about 1 month ago
by
ticoneva
Does this version still have the issue of extra spaces in Chinese environments?
1
#1 opened about 2 months ago
by
xxhf