LlamaGen-FlashAR / B /README.md
lxazjk's picture
Upload LlamaGen FlashAR checkpoints
0062bb1 verified

FlashAR-LlamaGen-B

Best GPT-B checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.

Files

  • FlashAR-LlamaGen-B.pt: clean inference checkpoint.
  • FlashAR-LlamaGen-B.json: metric and provenance sidecar.

Metrics

  • Dataset/eval: ImageNet-256
  • Step: 75,000
  • FID: 4.680193238979371
  • sFID: 6.680051826699128
  • Inception Score: 208.30068969726562
  • Precision: 0.83106
  • Recall: 0.4761

Checkpoint format

The .pt file contains:

  • model: model state dict only
  • args: original training args
  • steps: training step
  • metrics: best eval metrics

It does not contain optimizer state, scheduler state, or KV-cache buffers. The source training checkpoint contained 26 kv_cache buffers; they were explicitly removed before upload.