--- license: mit base_model: HiDream-ai/HiDream-O1-Image-Dev tags: - mlx - mlx-vlm - hidream - text-to-image - apple-silicon - quantized - q6 language: - en pipeline_tag: text-to-image library_name: mlx inference: false authors: - Mrbizarro --- # HiDream-O1-Image-Dev โ€” MLX Q6 (Apple Silicon) > Ported by **[Mrbizarro](https://huggingface.co/Mrbizarro)** ยท MIT licensed ยท published to mlx-community ## ๐ŸŽ›๏ธ Run it one-click in **[Phosphene](https://github.com/mrbizarro/phosphene)** Phosphene is a free local generative-video panel for Apple Silicon. HiDream is wired into its Image Studio. **[Install Pinokio](https://pinokio.computer)**, then in Pinokio install [Phosphene](https://github.com/mrbizarro/phosphene). Note: Phosphene's HiDream integration uses BF16 by default since **edit requires BF16** โ€” this Q6 repo is for text-to-image-only workflows on RAM-constrained machines. --- A 6-bit quantized MLX port of [HiDream-ai/HiDream-O1-Image-Dev](https://huggingface.co/HiDream-ai/HiDream-O1-Image-Dev). **โš  Q6 does NOT support edit / multi-ref.** Per-group dequantization noise compounds against reference-image features in attention and produces degenerate output. For edit / multi-reference workflows use the [BF16 sibling repo](https://huggingface.co/mlx-community/HiDream-O1-Image-Dev-mlx-bf16) instead. ## Sibling repos - ๐ŸŸข **[BF16 (full precision)](https://huggingface.co/mlx-community/HiDream-O1-Image-Dev-mlx-bf16)** โ€” 17.5 GB, ~16 GB RAM, clean across all dimensions. **Use this when in doubt.** - ๐ŸŸก **Q6 (this repo)** โ€” 8 GB, ~8.5 GB RAM, fast. **Best at square 2048ร—2048 or 1024ร—1024.** Visible 32-pixel patch grid in flat regions at non-square dims. - ๐ŸŸก [Q8](https://huggingface.co/mlx-community/HiDream-O1-Image-Dev-mlx-q8) โ€” 10 GB, ~11.5 GB RAM, same artifact behaviour as Q6 at non-square dims. ## When to use Q6 - โœ… **Square 1024ร—1024 or 2048ร—2048** โ€” clean output, half the time of BF16 - โœ… **RAM-constrained** โ€” fits 16 GB Macs alongside other apps - โŒ **Non-square dims (1440ร—2560, 3104ร—1312, etc)** โ€” visible 32-pixel patch grid in skies, walls, water โ†’ use BF16 ## What's in this repo - `model.safetensors` โ€” Q6 quantized backbone (8 GB) - `extras/custom_heads.safetensors` โ€” diffusion-side heads (75 MB, BF16) - `config.json` (with `quantization: {bits: 6, group_size: 64}` so mlx-vlm wraps `Linear โ†’ QuantizedLinear` correctly) - Tokenizer + processor configs ## Code The inference scripts are **not** in this repo โ€” they live in the [BF16 sibling repo](https://huggingface.co/mlx-community/HiDream-O1-Image-Dev-mlx-bf16) under `scripts/hidream_o1/`. Clone that for code, this for weights only. ## Quick start ```bash # Get the code hf download mlx-community/HiDream-O1-Image-Dev-mlx-bf16 --local-dir hidream-o1-mlx \ --include "scripts/*" --include "*.md" --include "*.txt" --include "*.gitattributes" cd hidream-o1-mlx uv venv --python 3.11 && uv pip install -r requirements.txt # Get the Q6 weights hf download mlx-community/HiDream-O1-Image-Dev-mlx-q6 --local-dir mlx_models/hidream-o1-dev-q6 # Run (square dims only for clean output) .venv/bin/python scripts/hidream_o1/generate_hidream_o1_mlx.py \ --model-path mlx_models/hidream-o1-dev-q6 \ --prompt "your prompt here" \ --width 2048 --height 2048 \ --output out.png ``` ## Performance | Resolution | Per step | Total (28 steps) | Peak RAM | Quality | |---|---|---|---|---| | 1024ร—1024 | 1.30 s | 36 s | 8.5 GB | โœ… clean | | 2048ร—2048 | 5.51 s | 154 s | 9 GB | โœ… clean | | 1440ร—2560 (non-square) | 4.50 s | 127 s | 8.5 GB | โš  patch grid visible | ## License MIT โ€” see the [BF16 repo](https://huggingface.co/mlx-community/HiDream-O1-Image-Dev-mlx-bf16) for the full LICENSE file and acknowledgements.