刘鑫 commited on
Commit
364d86e
·
1 Parent(s): 0203b68

fix: revert concurrency limit to 1 — nanovllm event loop is not thread-safe

Browse files

nanovllm-voxcpm uses a single asyncio event loop internally
(loop.run_until_complete). With concurrency > 1 multiple Gradio
worker threads hit the same loop simultaneously, causing
"RuntimeError: This event loop is already running" on every request.

Made-with: Cursor

Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -61,5 +61,5 @@ Recommended environment variables:
61
  - `NANOVLLM_TEMPERATURE`: defaults to `1.0`
62
  - `REQUEST_LOG_DIR`: optional persistent request log directory. Defaults to `/data/logs` when `/data` exists
63
  - `GRADIO_QUEUE_MAX_SIZE`: defaults to `10`
64
- - `GRADIO_DEFAULT_CONCURRENCY_LIMIT`: defaults to `1`
65
  - `GRADIO_SSR_MODE`: defaults to `false`
 
61
  - `NANOVLLM_TEMPERATURE`: defaults to `1.0`
62
  - `REQUEST_LOG_DIR`: optional persistent request log directory. Defaults to `/data/logs` when `/data` exists
63
  - `GRADIO_QUEUE_MAX_SIZE`: defaults to `10`
64
+ - `GRADIO_DEFAULT_CONCURRENCY_LIMIT`: defaults to `1` (nanovllm-voxcpm event loop is not thread-safe; do NOT increase)
65
  - `GRADIO_SSR_MODE`: defaults to `false`