FlashAR-LlamaGen-B

Best GPT-B checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.

Files

FlashAR-LlamaGen-B.pt: clean inference checkpoint.
FlashAR-LlamaGen-B.json: metric and provenance sidecar.

Metrics

Dataset/eval: ImageNet-256
Step: 75,000
FID: 4.680193238979371
sFID: 6.680051826699128
Inception Score: 208.30068969726562
Precision: 0.83106
Recall: 0.4761

Checkpoint format

The .pt file contains:

model: model state dict only
args: original training args
steps: training step
metrics: best eval metrics

It does not contain optimizer state, scheduler state, or KV-cache buffers. The source training checkpoint contained 26 kv_cache buffers; they were explicitly removed before upload.