| # FlashAR-LlamaGen-XL | |
| Best GPT-XL checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation. | |
| ## Files | |
| - `FlashAR-LlamaGen-XL.pt`: clean inference checkpoint. | |
| - `FlashAR-LlamaGen-XL.json`: metric and provenance sidecar. | |
| ## Metrics | |
| - Dataset/eval: ImageNet-256 | |
| - Step: 107,500 | |
| - FID: 3.054045162945613 | |
| - sFID: 6.683951400631031 | |
| - Inception Score: 259.35772705078125 | |
| - Precision: 0.80102 | |
| - Recall: 0.5758 | |
| ## Checkpoint format | |
| The `.pt` file contains: | |
| - `model`: model state dict only | |
| - `args`: original training args | |
| - `steps`: training step | |
| - `metrics`: best eval metrics | |
| It does not contain optimizer state, scheduler state, or KV-cache buffers. Verification found zero `kv_cache`, `k_cache`, or `v_cache` keys in this uploaded checkpoint. | |