File size: 761 Bytes
0062bb1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
# FlashAR-LlamaGen-B

Best GPT-B checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.

## Files

- `FlashAR-LlamaGen-B.pt`: clean inference checkpoint.
- `FlashAR-LlamaGen-B.json`: metric and provenance sidecar.

## Metrics

- Dataset/eval: ImageNet-256
- Step: 75,000
- FID: 4.680193238979371
- sFID: 6.680051826699128
- Inception Score: 208.30068969726562
- Precision: 0.83106
- Recall: 0.4761

## Checkpoint format

The `.pt` file contains:

- `model`: model state dict only
- `args`: original training args
- `steps`: training step
- `metrics`: best eval metrics

It does not contain optimizer state, scheduler state, or KV-cache buffers. The source training checkpoint contained 26 `kv_cache` buffers; they were explicitly removed before upload.