X-VLA v5 Checkpoints — BEHAVIOR-1K

Changes from v4

Replaced mean-pool + single-layer classifier_proj with per-classifier AttentionPool + 2-layer MLP projections (skill, task, object)
1% gradient flows back to VLM (instead of full .detach()) so the backbone learns discriminative features
Deeper projection: Linear(D→2D) → GELU → LayerNorm → Linear(2D→D) → GELU

Folder	Task	Steps	Notes
`task0-20k`	0 (turning_on_radio)	20,000	Single-task
`task40-30k`	40 (make_microwave_popcorn)	30,000	Single-task

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support