Update README.md
Browse files
README.md
CHANGED
|
@@ -20,7 +20,7 @@ This model was obtained by using the following branch with LLM Compressor: https
|
|
| 20 |
|
| 21 |
This model was deployed using the following branch with vLLM: https://github.com/vllm-project/vllm/pull/41276
|
| 22 |
```bash
|
| 23 |
-
|
| 24 |
```
|
| 25 |
|
| 26 |
## Evaluation
|
|
|
|
| 20 |
|
| 21 |
This model was deployed using the following branch with vLLM: https://github.com/vllm-project/vllm/pull/41276
|
| 22 |
```bash
|
| 23 |
+
vllm serve RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8 --tensor-parallel-size 4 --port 8089 --kv_cache_dtype="fp8"
|
| 24 |
```
|
| 25 |
|
| 26 |
## Evaluation
|