| --- |
| license: apache-2.0 |
| pipeline_tag: text-to-image |
| tags: |
| - image-generation |
| - autoregressive |
| - flashar |
| - emu3.5 |
| - emu3.5-image |
| --- |
| |
| # Emu3.5-Image-FlashAR |
|
|
| This repository hosts the FlashAR checkpoint for Emu3.5-Image. |
|
|
| FlashAR is introduced in **"FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation"**. It accelerates a pretrained raster-scan autoregressive image generator by adding a vertical prediction branch and a learnable fusion gate. Decoding proceeds by anti-diagonal steps, reducing the serial image-token decoding length from `H * W` to `H + W - 1`. |
|
|
| - Paper: [arXiv:2605.09430](https://arxiv.org/abs/2605.09430) |
| - Project page: [FlashAR](https://lxazjk.github.io/FlashAR/) |
| - Code: [Emu3.5-NAR](https://github.com/lxazjk/Emu3.5-NAR) |
|
|
| ## Checkpoint |
|
|
| This checkpoint corresponds to: |
|
|
| - Base model family: Emu3.5-Image |
| - Checkpoint name: `Emu3.5-Image-FlashAR` |
| - Training step: `74000` |
| - Default visual-token grid used by the release scripts: `32 x 32` |
| - Default CFG setting used for the released checkpoint metadata: `5.0` |
|
|
| The weight shards are stored at the repository root: |
|
|
| ```text |
| model-00001-of-00016.safetensors |
| ... |
| model-00016-of-00016.safetensors |
| model.safetensors.index.json |
| config.json |
| checkpoint_meta.json |
| configuration.json |
| ``` |
|
|
| ## Usage |
|
|
| Use this repository as the FlashAR checkpoint path together with: |
|
|
| - the base Emu3.5-Image model; |
| - the Emu3.5 vision tokenizer; |
| - the FlashAR code from the project repository. |
|
|
| Example layout: |
|
|
| ```text |
| weights/Emu3.5-Image/ |
| weights/Emu3.5-VisionTokenizer/ |
| checkpoints/Emu3.5-Image-FlashAR/ |
| ``` |
|
|
| Example generation command from the code repository: |
|
|
| ```bash |
| MODEL_PATH=./weights/Emu3.5-Image \ |
| TOKENIZER_PATH=./src/tokenizer_emu3_ibq \ |
| VQ_PATH=./weights/Emu3.5-VisionTokenizer \ |
| CKPT_PATH=./checkpoints/Emu3.5-Image-FlashAR \ |
| PROMPT="a red car parked next to a blue mailbox" \ |
| CFG_SCALE=5.0 \ |
| OUT_PATH=./outputs/sample.png \ |
| bash generate.sh |
| ``` |
|
|
| ## Citation |
|
|
| ```bibtex |
| @article{zhou2026flashar, |
| title={FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation}, |
| author={Zhou, Junkang and He, Yefei and Chen, Feng and Wang, Weijie and Zhuang, Bohan}, |
| journal={arXiv preprint arXiv:2605.09430}, |
| year={2026} |
| } |
| ``` |
|
|