lxazjk
/

LlamaGen-FlashAR

image-generation

Model card Files Files and versions

LlamaGen-FlashAR / XL /README.md

lxazjk's picture

Upload LlamaGen FlashAR checkpoints

0062bb1 verified 11 days ago

|

history blame contribute delete

752 Bytes

	# FlashAR-LlamaGen-XL

	Best GPT-XL checkpoint for FlashAR / LlamaGen NAR ImageNet-256 evaluation.

	## Files

	- `FlashAR-LlamaGen-XL.pt`: clean inference checkpoint.
	- `FlashAR-LlamaGen-XL.json`: metric and provenance sidecar.

	## Metrics

	- Dataset/eval: ImageNet-256
	- Step: 107,500
	- FID: 3.054045162945613
	- sFID: 6.683951400631031
	- Inception Score: 259.35772705078125
	- Precision: 0.80102
	- Recall: 0.5758

	## Checkpoint format

	The `.pt` file contains:

	- `model`: model state dict only
	- `args`: original training args
	- `steps`: training step
	- `metrics`: best eval metrics

	It does not contain optimizer state, scheduler state, or KV-cache buffers. Verification found zero `kv_cache`, `k_cache`, or `v_cache` keys in this uploaded checkpoint.