ronitraj commited on
Commit
3ed43f8
·
1 Parent(s): 8be4805

fix: add packaging dependency and update docker docs

Browse files
Files changed (3) hide show
  1. README.md +10 -3
  2. pyproject.toml +1 -0
  3. requirements.txt +1 -0
README.md CHANGED
@@ -213,12 +213,18 @@ docker build -t llmserve-env .
213
  docker run --rm -p 7860:7860 llmserve-env
214
  ```
215
 
 
 
 
 
 
 
216
  Then verify:
217
 
218
  - API: `http://localhost:7860/health`
219
  - OpenEnv UI: `http://localhost:7860/web`
220
 
221
- `server/Dockerfile` is kept only as a compatibility mirror. The repo-level `Dockerfile` is the one to use for local verification and submission hardening.
222
 
223
  ## Baseline Inference
224
 
@@ -305,8 +311,9 @@ Recommended deployment path:
305
  1. Push this repository to the Space.
306
  2. Use the root `Dockerfile`.
307
  3. Set the Space port to `7860`.
308
- 4. Add `OPENAI_API_KEY` as a secret only if you want the OpenAI baseline endpoint to run in the deployed Space.
309
- 5. After deployment, verify:
 
310
  - `/health`
311
  - `/tasks`
312
  - `/web`
 
213
  docker run --rm -p 7860:7860 llmserve-env
214
  ```
215
 
216
+ For a fully clean rebuild, use:
217
+
218
+ ```bash
219
+ docker build --no-cache -t llmserve-env .
220
+ ```
221
+
222
  Then verify:
223
 
224
  - API: `http://localhost:7860/health`
225
  - OpenEnv UI: `http://localhost:7860/web`
226
 
227
+ The root `Dockerfile` builds a CPU-only image and packages the tracked `weights/` directory into the container. That is the Dockerfile used for local verification and Hugging Face submission. `server/Dockerfile` is kept only as a compatibility mirror.
228
 
229
  ## Baseline Inference
230
 
 
311
  1. Push this repository to the Space.
312
  2. Use the root `Dockerfile`.
313
  3. Set the Space port to `7860`.
314
+ 4. Make sure the repository includes the `weights/` directory; the Docker image copies those model files at build time.
315
+ 5. Add `OPENAI_API_KEY` as a secret only if you want the OpenAI baseline endpoint to run in the deployed Space.
316
+ 6. After deployment, verify:
317
  - `/health`
318
  - `/tasks`
319
  - `/web`
pyproject.toml CHANGED
@@ -22,6 +22,7 @@ dependencies = [
22
  "pyarrow>=15.0,<20.0",
23
  "httpx>=0.27,<1.0",
24
  "gradio>=5.0,<7.0",
 
25
  "torch>=2.3,<3.0",
26
  ]
27
 
 
22
  "pyarrow>=15.0,<20.0",
23
  "httpx>=0.27,<1.0",
24
  "gradio>=5.0,<7.0",
25
+ "packaging>=24.0,<26.0",
26
  "torch>=2.3,<3.0",
27
  ]
28
 
requirements.txt CHANGED
@@ -11,6 +11,7 @@ numpy>=1.26,<3.0
11
  scipy>=1.12,<2.0
12
  pandas>=2.2,<3.0
13
  pyarrow>=15.0,<20.0
 
14
  torch>=2.3,<3.0
15
  gradio>=5.0,<7.0
16
 
 
11
  scipy>=1.12,<2.0
12
  pandas>=2.2,<3.0
13
  pyarrow>=15.0,<20.0
14
+ packaging>=24.0,<26.0
15
  torch>=2.3,<3.0
16
  gradio>=5.0,<7.0
17