fix: add packaging dependency and update docker docs
Browse files- README.md +10 -3
- pyproject.toml +1 -0
- requirements.txt +1 -0
README.md
CHANGED
|
@@ -213,12 +213,18 @@ docker build -t llmserve-env .
|
|
| 213 |
docker run --rm -p 7860:7860 llmserve-env
|
| 214 |
```
|
| 215 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 216 |
Then verify:
|
| 217 |
|
| 218 |
- API: `http://localhost:7860/health`
|
| 219 |
- OpenEnv UI: `http://localhost:7860/web`
|
| 220 |
|
| 221 |
-
`
|
| 222 |
|
| 223 |
## Baseline Inference
|
| 224 |
|
|
@@ -305,8 +311,9 @@ Recommended deployment path:
|
|
| 305 |
1. Push this repository to the Space.
|
| 306 |
2. Use the root `Dockerfile`.
|
| 307 |
3. Set the Space port to `7860`.
|
| 308 |
-
4.
|
| 309 |
-
5.
|
|
|
|
| 310 |
- `/health`
|
| 311 |
- `/tasks`
|
| 312 |
- `/web`
|
|
|
|
| 213 |
docker run --rm -p 7860:7860 llmserve-env
|
| 214 |
```
|
| 215 |
|
| 216 |
+
For a fully clean rebuild, use:
|
| 217 |
+
|
| 218 |
+
```bash
|
| 219 |
+
docker build --no-cache -t llmserve-env .
|
| 220 |
+
```
|
| 221 |
+
|
| 222 |
Then verify:
|
| 223 |
|
| 224 |
- API: `http://localhost:7860/health`
|
| 225 |
- OpenEnv UI: `http://localhost:7860/web`
|
| 226 |
|
| 227 |
+
The root `Dockerfile` builds a CPU-only image and packages the tracked `weights/` directory into the container. That is the Dockerfile used for local verification and Hugging Face submission. `server/Dockerfile` is kept only as a compatibility mirror.
|
| 228 |
|
| 229 |
## Baseline Inference
|
| 230 |
|
|
|
|
| 311 |
1. Push this repository to the Space.
|
| 312 |
2. Use the root `Dockerfile`.
|
| 313 |
3. Set the Space port to `7860`.
|
| 314 |
+
4. Make sure the repository includes the `weights/` directory; the Docker image copies those model files at build time.
|
| 315 |
+
5. Add `OPENAI_API_KEY` as a secret only if you want the OpenAI baseline endpoint to run in the deployed Space.
|
| 316 |
+
6. After deployment, verify:
|
| 317 |
- `/health`
|
| 318 |
- `/tasks`
|
| 319 |
- `/web`
|
pyproject.toml
CHANGED
|
@@ -22,6 +22,7 @@ dependencies = [
|
|
| 22 |
"pyarrow>=15.0,<20.0",
|
| 23 |
"httpx>=0.27,<1.0",
|
| 24 |
"gradio>=5.0,<7.0",
|
|
|
|
| 25 |
"torch>=2.3,<3.0",
|
| 26 |
]
|
| 27 |
|
|
|
|
| 22 |
"pyarrow>=15.0,<20.0",
|
| 23 |
"httpx>=0.27,<1.0",
|
| 24 |
"gradio>=5.0,<7.0",
|
| 25 |
+
"packaging>=24.0,<26.0",
|
| 26 |
"torch>=2.3,<3.0",
|
| 27 |
]
|
| 28 |
|
requirements.txt
CHANGED
|
@@ -11,6 +11,7 @@ numpy>=1.26,<3.0
|
|
| 11 |
scipy>=1.12,<2.0
|
| 12 |
pandas>=2.2,<3.0
|
| 13 |
pyarrow>=15.0,<20.0
|
|
|
|
| 14 |
torch>=2.3,<3.0
|
| 15 |
gradio>=5.0,<7.0
|
| 16 |
|
|
|
|
| 11 |
scipy>=1.12,<2.0
|
| 12 |
pandas>=2.2,<3.0
|
| 13 |
pyarrow>=15.0,<20.0
|
| 14 |
+
packaging>=24.0,<26.0
|
| 15 |
torch>=2.3,<3.0
|
| 16 |
gradio>=5.0,<7.0
|
| 17 |
|