kylesayrs commited on
Commit
9ff4a84
·
verified ·
1 Parent(s): 6c21b8f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -20,7 +20,7 @@ This model was obtained by using the following branch with LLM Compressor: https
20
 
21
  This model was deployed using the following branch with vLLM: https://github.com/vllm-project/vllm/pull/41276
22
  ```bash
23
- VLLM_NVFP4_GEMM_BACKEND=marlin vllm serve RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8 --tensor-parallel-size 4 --port 8089 --kv_cache_dtype="fp8"
24
  ```
25
 
26
  ## Evaluation
 
20
 
21
  This model was deployed using the following branch with vLLM: https://github.com/vllm-project/vllm/pull/41276
22
  ```bash
23
+ vllm serve RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8 --tensor-parallel-size 4 --port 8089 --kv_cache_dtype="fp8"
24
  ```
25
 
26
  ## Evaluation