File size: 2,237 Bytes
c214bd2 815dea4 c214bd2 815dea4 c214bd2 815dea4 c214bd2 bb7157e c214bd2 bb7157e c214bd2 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 | ---
license: apache-2.0
pipeline_tag: text-to-image
tags:
- image-generation
- autoregressive
- flashar
- emu3.5
- emu3.5-image
---
# Emu3.5-Image-FlashAR
This repository hosts the FlashAR checkpoint for Emu3.5-Image.
FlashAR is introduced in **"FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation"**. It accelerates a pretrained raster-scan autoregressive image generator by adding a vertical prediction branch and a learnable fusion gate. Decoding proceeds by anti-diagonal steps, reducing the serial image-token decoding length from `H * W` to `H + W - 1`.
- Paper: [arXiv:2605.09430](https://arxiv.org/abs/2605.09430)
- Project page: [FlashAR](https://lxazjk.github.io/FlashAR/)
- Code: [Emu3.5-NAR](https://github.com/lxazjk/Emu3.5-NAR)
## Checkpoint
This checkpoint corresponds to:
- Base model family: Emu3.5-Image
- Checkpoint name: `Emu3.5-Image-FlashAR`
- Training step: `74000`
- Default visual-token grid used by the release scripts: `32 x 32`
- Default CFG setting used for the released checkpoint metadata: `5.0`
The weight shards are stored at the repository root:
```text
model-00001-of-00016.safetensors
...
model-00016-of-00016.safetensors
model.safetensors.index.json
config.json
checkpoint_meta.json
configuration.json
```
## Usage
Use this repository as the FlashAR checkpoint path together with:
- the base Emu3.5-Image model;
- the Emu3.5 vision tokenizer;
- the FlashAR code from the project repository.
Example layout:
```text
weights/Emu3.5-Image/
weights/Emu3.5-VisionTokenizer/
checkpoints/Emu3.5-Image-FlashAR/
```
Example generation command from the code repository:
```bash
MODEL_PATH=./weights/Emu3.5-Image \
TOKENIZER_PATH=./src/tokenizer_emu3_ibq \
VQ_PATH=./weights/Emu3.5-VisionTokenizer \
CKPT_PATH=./checkpoints/Emu3.5-Image-FlashAR \
PROMPT="a red car parked next to a blue mailbox" \
CFG_SCALE=5.0 \
OUT_PATH=./outputs/sample.png \
bash generate.sh
```
## Citation
```bibtex
@article{zhou2026flashar,
title={FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation},
author={Zhou, Junkang and He, Yefei and Chen, Feng and Wang, Weijie and Zhuang, Bohan},
journal={arXiv preprint arXiv:2605.09430},
year={2026}
}
```
|