Qi Cai commited on
Commit
3bfc867
Β·
1 Parent(s): 85dbe17

update link

Browse files
Files changed (1) hide show
  1. README.md +13 -2
README.md CHANGED
@@ -11,7 +11,7 @@ HiDream-O1-Image is a natively unified image generative foundation model built o
11
  - πŸš€ **May 14, 2026:** We open-sourced [**HiDream-O1-Image-Dev-2604**](https://huggingface.co/HiDream-ai/HiDream-O1-Image-Dev-2604) with its [prompt refiner](https://huggingface.co/HiDream-ai/Prompt-Refine), tailored for text-to-image generation task.
12
  - πŸ› οΈ **May 13, 2026:** Inference & pipeline updates β€” accelerated IP inference; the IP pipeline now supports **layout** and **skeleton** conditioning; updated the Dev editing scheduler. For editing tasks we recommend using the **full** model. PyTorch 2.9.x is not recommended due to the [issue](https://github.com/QwenLM/Qwen3-VL/issues/1811).
13
  - πŸ€— **May 10, 2026:** Try **HiDream-O1-Image** online on Hugging Face Spaces β€” [πŸ€— HiDream-O1-Image](https://huggingface.co/spaces/HiDream-ai/HiDream-O1-Image) and [πŸ€— HiDream-O1-Image-Dev](https://huggingface.co/spaces/HiDream-ai/HiDream-O1-Image-Dev).
14
- - πŸ“• **May 10, 2026:** Our **technical report** is now available β€” [πŸ“‘ HiDream-O1-Image.pdf](https://github.com/HiDream-ai/HiDream-O1-Image/blob/main/assets/HiDream-O1-Image.pdf).
15
  - πŸš€ **May 8, 2026:** We've open-sourced **HiDream-O1-Image (8B)**, including both the undistilled and distilled Dev variants, together with the Reasoning-Driven Prompt Agent.
16
 
17
  > **HiDream-O1-Image-Dev-2604 debuts at #8 in the Artificial Analysis Text to Image Arena, which is positioned to be the new leading open weights Text to Image model.**
@@ -364,4 +364,15 @@ The sidebar contains a Prompt Agent panel that calls the same Reasoning-Driven P
364
  When the server is launched with `--model_type dev`, the **Edit** tab exposes a *Scheduler* dropdown with two options: `flow_match` (default) and `flash`. The selector is hidden for the `full` model and for the Text β†’ Image / Subject tabs, where the scheduler is fixed.
365
 
366
  ## License
367
- The code in this repository and the HiDream-O1-Image models are licensed under MIT License.
 
 
 
 
 
 
 
 
 
 
 
 
11
  - πŸš€ **May 14, 2026:** We open-sourced [**HiDream-O1-Image-Dev-2604**](https://huggingface.co/HiDream-ai/HiDream-O1-Image-Dev-2604) with its [prompt refiner](https://huggingface.co/HiDream-ai/Prompt-Refine), tailored for text-to-image generation task.
12
  - πŸ› οΈ **May 13, 2026:** Inference & pipeline updates β€” accelerated IP inference; the IP pipeline now supports **layout** and **skeleton** conditioning; updated the Dev editing scheduler. For editing tasks we recommend using the **full** model. PyTorch 2.9.x is not recommended due to the [issue](https://github.com/QwenLM/Qwen3-VL/issues/1811).
13
  - πŸ€— **May 10, 2026:** Try **HiDream-O1-Image** online on Hugging Face Spaces β€” [πŸ€— HiDream-O1-Image](https://huggingface.co/spaces/HiDream-ai/HiDream-O1-Image) and [πŸ€— HiDream-O1-Image-Dev](https://huggingface.co/spaces/HiDream-ai/HiDream-O1-Image-Dev).
14
+ - πŸ“• **May 10, 2026:** Our **technical report** is now available β€” [πŸ“‘ HiDream-O1-Image.pdf](https://arxiv.org/pdf/2605.11061v1).
15
  - πŸš€ **May 8, 2026:** We've open-sourced **HiDream-O1-Image (8B)**, including both the undistilled and distilled Dev variants, together with the Reasoning-Driven Prompt Agent.
16
 
17
  > **HiDream-O1-Image-Dev-2604 debuts at #8 in the Artificial Analysis Text to Image Arena, which is positioned to be the new leading open weights Text to Image model.**
 
364
  When the server is launched with `--model_type dev`, the **Edit** tab exposes a *Scheduler* dropdown with two options: `flow_match` (default) and `flash`. The selector is hidden for the `full` model and for the Text β†’ Image / Subject tabs, where the scheduler is fixed.
365
 
366
  ## License
367
+ The code in this repository and the HiDream-O1-Image models are licensed under MIT License.
368
+
369
+ ## Citation
370
+
371
+ ```bibtex
372
+ @article{hidreamolimage,
373
+ title={HiDream-O1-Image: A Natively Unified Image Generative Foundation Model with Pixel-level Unified Transformer},
374
+ author={Cai, Qi and Chen, Jingwen and Gao, Chengmin and Gong, Zijian and Li, Yehao and Mei, Tao and Pan, Yingwei and Peng, Yi and Qiu, Zhaofan and Yao, Ting and Yu, Kai and Zhang, Yiheng and others},
375
+ journal={arXiv preprint arXiv:2605.11061},
376
+ year={2026}
377
+ }
378
+ ```