Spaces:

techfreakworm
/

LTX2.3-Studio

Running on Zero

App Files Files Community

techfreakworm commited on 18 days ago

Commit

d3e4302

unverified ·

1 Parent(s): 14fcab5

fix(spaces): bump @spaces.GPU duration 300s → 600s

Browse files

Lipsync hit GPU task aborted exactly at the 300s wall. The workflow runs:
init/load (~3 min on cold cache) + sampler 1 (~50s) + sampler 2 (~30s) +
VAE decode + ffmpeg audio-mux. Total ran 305s — killed mid-mux just as
the video was about to land on disk.

The duration param is a timeout, not a billing cap (HF bills actual
usage). Bumping to 600s (10 min) gives heavy modes — lipsync, future
keyframe and style transfer with bigger workflows — headroom on cold
caches without affecting lighter T2V calls (those still finish in ~30s
and release the GPU back to the pool immediately).

Files changed (1) hide show

backend.py +7 -4

backend.py CHANGED Viewed

@@ -65,9 +65,12 @@ def _identity(fn):
 # ZeroGPU's startup detector scans loaded modules for spaces.GPU-wrapped
 # functions. The decorator must be applied at module load time — runtime
-# wrapping inside a request handler isn't detected. Pro tier per-call cap is
-# 300s, so we use that ceiling and let modes finish whenever they finish.
-_GPU = spaces.GPU(duration=300) if (spaces is not None and _on_spaces()) else _identity
 @_GPU
@@ -359,7 +362,7 @@ class ComfyUILibraryBackend:
                 # ProgressBar class reads, but is the documented API.
                 comfy.utils.set_progress_bar_global_hook(_hook)
                 # _execute_workflow is module-level and decorated with
-                # @spaces.GPU(duration=300) on Spaces — that's what makes the
                 # heavy compute run on a borrowed H200. Off-Spaces it's a
                 # plain call. Returns the video path directly (computed
                 # inside the GPU context so the executor's history is fresh).

 # ZeroGPU's startup detector scans loaded modules for spaces.GPU-wrapped
 # functions. The decorator must be applied at module load time — runtime
+# wrapping inside a request handler isn't detected. `duration` is the per-call
+# timeout, NOT a billing cap (HF bills actual usage). Setting it generously
+# (10 min) so heavy modes like lipsync (audio encoder + extra LoRAs + VAE
+# decode + ffmpeg mux) don't hit the 300s wall mid-mux. Light modes return
+# in ~30-60s and free the GPU back into the pool.
+_GPU = spaces.GPU(duration=600) if (spaces is not None and _on_spaces()) else _identity
 @_GPU
                 # ProgressBar class reads, but is the documented API.
                 comfy.utils.set_progress_bar_global_hook(_hook)
                 # _execute_workflow is module-level and decorated with
+                # @spaces.GPU(duration=600) on Spaces — that's what makes the
                 # heavy compute run on a borrowed H200. Off-Spaces it's a
                 # plain call. Returns the video path directly (computed
                 # inside the GPU context so the executor's history is fresh).