Update README — mirror is now bf16 (~15.7 GB, half the fp32 footprint)
Browse files
README.md
CHANGED
|
@@ -69,8 +69,8 @@ Generation knobs exposed: prompt, negative prompt, duration, steps, CFG scale, A
|
|
| 69 |
## Format
|
| 70 |
|
| 71 |
- **All weights are `safetensors`.** No `.pt` / `.ckpt` / `.bin` in this mirror.
|
| 72 |
-
- Mirror is **
|
| 73 |
-
- Approximate disk sizes per subdir: small variants ~
|
| 74 |
|
| 75 |
## Usage
|
| 76 |
|
|
|
|
| 69 |
## Format
|
| 70 |
|
| 71 |
- **All weights are `safetensors`.** No `.pt` / `.ckpt` / `.bin` in this mirror.
|
| 72 |
+
- Mirror is **bf16** — re-saved via `safetensors.torch.save_model` (preserves shared RotaryEmbedding buffers that bare `save_file` would corrupt). Bytewise this halves disk size vs the fp32 upstream. The MAESTRO runner upcasts to fp32 transiently during `load_state_dict` then casts to fp16 (`model_half=True`) for inference — runtime VRAM is unchanged from the fp32 mirror, but disk + I/O + initial safetensors-read CPU spike are all halved.
|
| 73 |
+
- Approximate disk sizes per subdir: small variants ~1.14 GB each, medium variants ~4.61 GB each, SAME-S ~0.22 GB, SAME-L ~1.70 GB. Total mirror footprint ≈ 15.7 GB.
|
| 74 |
|
| 75 |
## Usage
|
| 76 |
|