techfreakworm commited on
Commit
c53ac67
·
unverified ·
1 Parent(s): 099a5f3

fix(spaces): drop 123 GB of unused preload (storage cap was 150 GB)

Browse files

HF evicted the workload at first deploy with "storage limit exceeded
(150G)" — total preload was ~234 GB. Trimming to fit:

- Drop ltx-2.3-22b-dev.safetensors and ltx-2.3-22b-distilled.safetensors
from Lightricks/LTX-2.3 (84 GB combined). Neither is referenced by
any workflow JSON we ship; the active transformer path uses the Kijai
bf16 variant.
- Drop the unsloth/LTX-2.3-GGUF entry entirely (39 GB). Only used by a
hypothetical low-VRAM preset that we don't currently route.

New total: ~111 GB. Marks future_improvements items 1 and 2 done.

Files changed (2) hide show
  1. README.md +1 -2
  2. docs/future_improvements.md +11 -12
README.md CHANGED
@@ -16,10 +16,9 @@ preload_from_hub:
16
  - Lightricks/LTX-2-19b-LoRA-Camera-Control-Jib-Down ltx-2-19b-lora-camera-control-jib-down.safetensors
17
  - Lightricks/LTX-2-19b-LoRA-Camera-Control-Jib-Up ltx-2-19b-lora-camera-control-jib-up.safetensors
18
  - Lightricks/LTX-2-19b-LoRA-Camera-Control-Static ltx-2-19b-lora-camera-control-static.safetensors
19
- - Lightricks/LTX-2.3 ltx-2.3-22b-dev.safetensors,ltx-2.3-22b-distilled-lora-384.safetensors,ltx-2.3-22b-distilled.safetensors,ltx-2.3-spatial-upscaler-x2-1.0.safetensors
20
  - Lightricks/LTX-2.3-22b-IC-LoRA-Union-Control ltx-2.3-22b-ic-lora-union-control-ref0.5.safetensors
21
  - google/gemma-3-12b-it-qat-q4_0-unquantized gemma-3-12b-it/model-00001-of-00005.safetensors,gemma-3-12b-it/model-00002-of-00005.safetensors,gemma-3-12b-it/model-00003-of-00005.safetensors,gemma-3-12b-it/model-00004-of-00005.safetensors,gemma-3-12b-it/model-00005-of-00005.safetensors,gemma-3-12b-it/model.safetensors.index.json,gemma-3-12b-it/preprocessor_config.json,gemma-3-12b-it/tokenizer.model
22
- - unsloth/LTX-2.3-GGUF ltx-2.3-22b-dev-BF16.gguf
23
  ---
24
 
25
  # LTX 2.3 All-in-One Video Generator
 
16
  - Lightricks/LTX-2-19b-LoRA-Camera-Control-Jib-Down ltx-2-19b-lora-camera-control-jib-down.safetensors
17
  - Lightricks/LTX-2-19b-LoRA-Camera-Control-Jib-Up ltx-2-19b-lora-camera-control-jib-up.safetensors
18
  - Lightricks/LTX-2-19b-LoRA-Camera-Control-Static ltx-2-19b-lora-camera-control-static.safetensors
19
+ - Lightricks/LTX-2.3 ltx-2.3-22b-distilled-lora-384.safetensors,ltx-2.3-spatial-upscaler-x2-1.0.safetensors
20
  - Lightricks/LTX-2.3-22b-IC-LoRA-Union-Control ltx-2.3-22b-ic-lora-union-control-ref0.5.safetensors
21
  - google/gemma-3-12b-it-qat-q4_0-unquantized gemma-3-12b-it/model-00001-of-00005.safetensors,gemma-3-12b-it/model-00002-of-00005.safetensors,gemma-3-12b-it/model-00003-of-00005.safetensors,gemma-3-12b-it/model-00004-of-00005.safetensors,gemma-3-12b-it/model-00005-of-00005.safetensors,gemma-3-12b-it/model.safetensors.index.json,gemma-3-12b-it/preprocessor_config.json,gemma-3-12b-it/tokenizer.model
 
22
  ---
23
 
24
  # LTX 2.3 All-in-One Video Generator
docs/future_improvements.md CHANGED
@@ -6,25 +6,24 @@ not commitment.
6
 
7
  ## Spaces / preload
8
 
9
- ### 1. Stop preloading models that aren't referenced by any workflow
10
 
11
- Audit on 2026-05-02 (`tools/audit-models` style script) showed two `Lightricks/LTX-2.3`
12
- files in `preload_from_hub` that aren't actually referenced by any workflow JSON
13
- we ship:
14
 
15
  - `ltx-2.3-22b-dev.safetensors` (~42 GB)
16
  - `ltx-2.3-22b-distilled.safetensors` (~42 GB)
17
 
18
- The active path uses `Kijai/LTX2.3_comfy diffusion_models/ltx-2.3-22b-dev_transformer_only_bf16.safetensors`.
19
- Removing both saves ~84 GB of preload bandwidth/storage. Risk: if a future
20
- workflow update reintroduces the Lightricks-side filenames, lazy download
21
- takes over (slow first inference) acceptable for the tradeoff.
22
 
23
- ### 2. Drop `unsloth/LTX-2.3-GGUF` from preload (~39 GB)
24
 
25
- The GGUF transformer is the low-VRAM alternative. ZeroGPU H200 has 70 GB so
26
- the BF16 transformer always fits. Lazy-load when a future "Low VRAM" preset
27
- actually wires the GGUF path.
28
 
29
  ### 3. Drop the `Lightricks/LTX-2-19b-LoRA-Camera-Control-Static/Jib-Up/Jib-Down` preload
30
 
 
6
 
7
  ## Spaces / preload
8
 
9
+ ### ~~1. Stop preloading models that aren't referenced by any workflow~~ — DONE 2026-05-02
10
 
11
+ Audit on 2026-05-02 showed two `Lightricks/LTX-2.3` files in `preload_from_hub`
12
+ that aren't actually referenced by any workflow JSON we ship:
 
13
 
14
  - `ltx-2.3-22b-dev.safetensors` (~42 GB)
15
  - `ltx-2.3-22b-distilled.safetensors` (~42 GB)
16
 
17
+ The active path uses `Kijai/LTX2.3_comfy ltx-2.3-22b-dev_transformer_only_bf16.safetensors`.
18
+ Removed both ~84 GB saved. Forced by HF eviction with `storage limit
19
+ exceeded (150G)` when total preload was ~234 GB. Risk: if a future workflow
20
+ update reintroduces the Lightricks-side filenames, lazy download takes over.
21
 
22
+ ### ~~2. Drop `unsloth/LTX-2.3-GGUF` from preload (~39 GB)~~ — DONE 2026-05-02
23
 
24
+ Removed alongside (1). GGUF transformer is the low-VRAM alternative; ZeroGPU
25
+ H200 has 70 GB so the BF16 transformer always fits. Lazy-loads on first use
26
+ of any preset that wires the GGUF path.
27
 
28
  ### 3. Drop the `Lightricks/LTX-2-19b-LoRA-Camera-Control-Static/Jib-Up/Jib-Down` preload
29