LlamaGen-FlashAR / XL /README.md
lxazjk's picture
Upload LlamaGen FlashAR checkpoints
0062bb1 verified
# FlashAR-LlamaGen-XL
Best GPT-XL checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.
## Files
- `FlashAR-LlamaGen-XL.pt`: clean inference checkpoint.
- `FlashAR-LlamaGen-XL.json`: metric and provenance sidecar.
## Metrics
- Dataset/eval: ImageNet-256
- Step: 107,500
- FID: 3.054045162945613
- sFID: 6.683951400631031
- Inception Score: 259.35772705078125
- Precision: 0.80102
- Recall: 0.5758
## Checkpoint format
The `.pt` file contains:
- `model`: model state dict only
- `args`: original training args
- `steps`: training step
- `metrics`: best eval metrics
It does not contain optimizer state, scheduler state, or KV-cache buffers. Verification found zero `kv_cache`, `k_cache`, or `v_cache` keys in this uploaded checkpoint.