LlamaGen-FlashAR / B /README.md
lxazjk's picture
Upload LlamaGen FlashAR checkpoints
0062bb1 verified
# FlashAR-LlamaGen-B
Best GPT-B checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.
## Files
- `FlashAR-LlamaGen-B.pt`: clean inference checkpoint.
- `FlashAR-LlamaGen-B.json`: metric and provenance sidecar.
## Metrics
- Dataset/eval: ImageNet-256
- Step: 75,000
- FID: 4.680193238979371
- sFID: 6.680051826699128
- Inception Score: 208.30068969726562
- Precision: 0.83106
- Recall: 0.4761
## Checkpoint format
The `.pt` file contains:
- `model`: model state dict only
- `args`: original training args
- `steps`: training step
- `metrics`: best eval metrics
It does not contain optimizer state, scheduler state, or KV-cache buffers. The source training checkpoint contained 26 `kv_cache` buffers; they were explicitly removed before upload.