File size: 2,237 Bytes
c214bd2
 
 
 
 
 
 
 
 
 
815dea4
c214bd2
815dea4
c214bd2
815dea4
c214bd2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
bb7157e
c214bd2
 
 
bb7157e
c214bd2
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
---
license: apache-2.0
pipeline_tag: text-to-image
tags:
- image-generation
- autoregressive
- flashar
- emu3.5
- emu3.5-image
---

# Emu3.5-Image-FlashAR

This repository hosts the FlashAR checkpoint for Emu3.5-Image.

FlashAR is introduced in **"FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation"**. It accelerates a pretrained raster-scan autoregressive image generator by adding a vertical prediction branch and a learnable fusion gate. Decoding proceeds by anti-diagonal steps, reducing the serial image-token decoding length from `H * W` to `H + W - 1`.

- Paper: [arXiv:2605.09430](https://arxiv.org/abs/2605.09430)
- Project page: [FlashAR](https://lxazjk.github.io/FlashAR/)
- Code: [Emu3.5-NAR](https://github.com/lxazjk/Emu3.5-NAR)

## Checkpoint

This checkpoint corresponds to:

- Base model family: Emu3.5-Image
- Checkpoint name: `Emu3.5-Image-FlashAR`
- Training step: `74000`
- Default visual-token grid used by the release scripts: `32 x 32`
- Default CFG setting used for the released checkpoint metadata: `5.0`

The weight shards are stored at the repository root:

```text
model-00001-of-00016.safetensors
...
model-00016-of-00016.safetensors
model.safetensors.index.json
config.json
checkpoint_meta.json
configuration.json
```

## Usage

Use this repository as the FlashAR checkpoint path together with:

- the base Emu3.5-Image model;
- the Emu3.5 vision tokenizer;
- the FlashAR code from the project repository.

Example layout:

```text
weights/Emu3.5-Image/
weights/Emu3.5-VisionTokenizer/
checkpoints/Emu3.5-Image-FlashAR/
```

Example generation command from the code repository:

```bash
MODEL_PATH=./weights/Emu3.5-Image \
TOKENIZER_PATH=./src/tokenizer_emu3_ibq \
VQ_PATH=./weights/Emu3.5-VisionTokenizer \
CKPT_PATH=./checkpoints/Emu3.5-Image-FlashAR \
PROMPT="a red car parked next to a blue mailbox" \
CFG_SCALE=5.0 \
OUT_PATH=./outputs/sample.png \
bash generate.sh
```

## Citation

```bibtex
@article{zhou2026flashar,
  title={FlashAR: Efficient Post-Training Acceleration for Autoregressive Image Generation},
  author={Zhou, Junkang and He, Yefei and Chen, Feng and Wang, Weijie and Zhuang, Bohan},
  journal={arXiv preprint arXiv:2605.09430},
  year={2026}
}
```