| # FlashAR-LlamaGen-B | |
| Best GPT-B checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation. | |
| ## Files | |
| - `FlashAR-LlamaGen-B.pt`: clean inference checkpoint. | |
| - `FlashAR-LlamaGen-B.json`: metric and provenance sidecar. | |
| ## Metrics | |
| - Dataset/eval: ImageNet-256 | |
| - Step: 75,000 | |
| - FID: 4.680193238979371 | |
| - sFID: 6.680051826699128 | |
| - Inception Score: 208.30068969726562 | |
| - Precision: 0.83106 | |
| - Recall: 0.4761 | |
| ## Checkpoint format | |
| The `.pt` file contains: | |
| - `model`: model state dict only | |
| - `args`: original training args | |
| - `steps`: training step | |
| - `metrics`: best eval metrics | |
| It does not contain optimizer state, scheduler state, or KV-cache buffers. The source training checkpoint contained 26 `kv_cache` buffers; they were explicitly removed before upload. | |