File size: 761 Bytes
0062bb1 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 | # FlashAR-LlamaGen-B
Best GPT-B checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.
## Files
- `FlashAR-LlamaGen-B.pt`: clean inference checkpoint.
- `FlashAR-LlamaGen-B.json`: metric and provenance sidecar.
## Metrics
- Dataset/eval: ImageNet-256
- Step: 75,000
- FID: 4.680193238979371
- sFID: 6.680051826699128
- Inception Score: 208.30068969726562
- Precision: 0.83106
- Recall: 0.4761
## Checkpoint format
The `.pt` file contains:
- `model`: model state dict only
- `args`: original training args
- `steps`: training step
- `metrics`: best eval metrics
It does not contain optimizer state, scheduler state, or KV-cache buffers. The source training checkpoint contained 26 `kv_cache` buffers; they were explicitly removed before upload.
|