Spaces:
Running on A10G
Running on A10G
刘鑫 commited on
Commit ·
364d86e
1
Parent(s): 0203b68
fix: revert concurrency limit to 1 — nanovllm event loop is not thread-safe
Browse filesnanovllm-voxcpm uses a single asyncio event loop internally
(loop.run_until_complete). With concurrency > 1 multiple Gradio
worker threads hit the same loop simultaneously, causing
"RuntimeError: This event loop is already running" on every request.
Made-with: Cursor
README.md
CHANGED
|
@@ -61,5 +61,5 @@ Recommended environment variables:
|
|
| 61 |
- `NANOVLLM_TEMPERATURE`: defaults to `1.0`
|
| 62 |
- `REQUEST_LOG_DIR`: optional persistent request log directory. Defaults to `/data/logs` when `/data` exists
|
| 63 |
- `GRADIO_QUEUE_MAX_SIZE`: defaults to `10`
|
| 64 |
-
- `GRADIO_DEFAULT_CONCURRENCY_LIMIT`: defaults to `1`
|
| 65 |
- `GRADIO_SSR_MODE`: defaults to `false`
|
|
|
|
| 61 |
- `NANOVLLM_TEMPERATURE`: defaults to `1.0`
|
| 62 |
- `REQUEST_LOG_DIR`: optional persistent request log directory. Defaults to `/data/logs` when `/data` exists
|
| 63 |
- `GRADIO_QUEUE_MAX_SIZE`: defaults to `10`
|
| 64 |
+
- `GRADIO_DEFAULT_CONCURRENCY_LIMIT`: defaults to `1` (nanovllm-voxcpm event loop is not thread-safe; do NOT increase)
|
| 65 |
- `GRADIO_SSR_MODE`: defaults to `false`
|