FlashAR-LlamaGen-XL
Best GPT-XL checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.
Files
FlashAR-LlamaGen-XL.pt: clean inference checkpoint.FlashAR-LlamaGen-XL.json: metric and provenance sidecar.
Metrics
- Dataset/eval: ImageNet-256
- Step: 107,500
- FID: 3.054045162945613
- sFID: 6.683951400631031
- Inception Score: 259.35772705078125
- Precision: 0.80102
- Recall: 0.5758
Checkpoint format
The .pt file contains:
model: model state dict onlyargs: original training argssteps: training stepmetrics: best eval metrics
It does not contain optimizer state, scheduler state, or KV-cache buffers. Verification found zero kv_cache, k_cache, or v_cache keys in this uploaded checkpoint.