HiDream-ai
/

HiDream-O1-Image

Image-Text-to-Image

image-text-to-text

Model card Files Files and versions

cai-qi commited on 11 days ago

Commit

ef3abc9

·

verified ·

1 Parent(s): 4114874

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -8,6 +8,14 @@ library_name: transformers
 HiDream-O1-Image is a natively unified image generative foundation model built on a Pixel-level Unified Transformer (UiT) without external VAEs or disjoint text encoders, which natively encodes raw pixels, text, and task-specific conditions in a single shared token space — supporting text-to-image, image editing, and subject-driven personalization at up to 2,048 × 2,048.
 <p align="center">
   <img src="assets/general.webp" alt="General text-to-image generation" width="100%"/>
   <br><sub><b>General text-to-image generation</b> at up to 2,048 × 2,048.</sub>

 HiDream-O1-Image is a natively unified image generative foundation model built on a Pixel-level Unified Transformer (UiT) without external VAEs or disjoint text encoders, which natively encodes raw pixels, text, and task-specific conditions in a single shared token space — supporting text-to-image, image editing, and subject-driven personalization at up to 2,048 × 2,048.
+> **HiDream-O1-Image (codename: Peanut) debuts at #8 in the Artificial Analysis Text to Image Arena, which is positioned to be the new leading open weights Text to Image model (2026-5-5).**
+<p align="center">
+  <img src="assets/leaderboard.png" alt="Artificial Analysis Text to Image Arena" width="100%"/>
+  <br><sub><b>Artificial Analysis Text to Image Arena</b> at up to 2,048 × 2,048.</sub>
+</p>
 <p align="center">
   <img src="assets/general.webp" alt="General text-to-image generation" width="100%"/>
   <br><sub><b>General text-to-image generation</b> at up to 2,048 × 2,048.</sub>