Update README.md
Browse files
README.md
CHANGED
|
@@ -236,18 +236,16 @@ middle"><strong>3.08/22.92</strong></td>
|
|
| 236 |
## 4. Deployment
|
| 237 |
|
| 238 |
> [!Note]
|
| 239 |
-
> You can access JoyAI-LLM Flash API on https://docs.jdcloud.com/cn/jdaip/chat
|
| 240 |
-
>
|
| 241 |
-
> Currently, JoyAI-LLM Flash is recommended to run on the following inference
|
| 242 |
-
> engines:
|
| 243 |
|
| 244 |
* vLLM
|
| 245 |
* SGLang
|
| 246 |
|
| 247 |
The minimum version requirement for `transformers` is `4.57.1`.
|
| 248 |
|
| 249 |
-
Deployment examples can be found in the [Model Deployment
|
| 250 |
-
|
| 251 |
|
| 252 |
|
| 253 |
|
|
|
|
| 236 |
## 4. Deployment
|
| 237 |
|
| 238 |
> [!Note]
|
| 239 |
+
> You can access JoyAI-LLM Flash API on https://docs.jdcloud.com/cn/jdaip/chat and we provide OpenAI/Anthropic-compatible API for you.
|
| 240 |
+
> Currently, JoyAI-LLM-Flash-FP8 is recommended to run on the following inference engines:
|
|
|
|
|
|
|
| 241 |
|
| 242 |
* vLLM
|
| 243 |
* SGLang
|
| 244 |
|
| 245 |
The minimum version requirement for `transformers` is `4.57.1`.
|
| 246 |
|
| 247 |
+
Deployment examples can be found in the [Model Deployment Guide](docs/deploy_guidance.md).
|
| 248 |
+
|
| 249 |
|
| 250 |
|
| 251 |
|