FrameSkip: Learning from Fewer but More Informative Frames in VLA Training
Paper | Code | Collection
FrameSkip is a training-time frame selection framework for Vision-Language-Action (VLA) models. Instead of treating every frame in a dense robot demonstration trajectory as equally useful supervision, FrameSkip scores trajectory frames with lightweight cues and trains primarily from fewer but more informative frames.
FrameSkip is designed as a data-layer intervention: it changes which frames are exposed during training while leaving the VLA architecture, action head, training objective, and inference procedure unchanged.
Highlights
- Frame-level supervision allocation: Addresses the temporal supervision imbalance where low-change segments often dominate training trajectories.
- Architecture-agnostic: Operates entirely in the dataloader, requiring no changes to the model architecture or inference.
- Importance-guided retention: Scores frames using action variation, visual-action coherence, task-progress priors, and gripper-transition preservation.
- Improved Efficiency: Achieves significantly higher success rates across benchmarks (RoboCasa-GR1, SimplerEnv, and LIBERO) while using as little as 20% of unique frames.
Usage
FrameSkip is built on the starVLA training and evaluation stack. The released checkpoints follow the standard starVLA checkpoint format and can be loaded in the same way as starVLA VLA policies.
For simulation evaluation, please refer to the model loading and evaluation workflow of the QwenGR00T architecture in starVLA, and replace the checkpoint path with the downloaded FrameSkip checkpoint.
Citation
If you find FrameSkip useful, please cite the paper:
@article{FrameSkip,
title={FrameSkip: Learning from Fewer but More Informative Frames in VLA Training},
author={Bin Yu and Shijie Lian and Xiaopeng Lin and Zhaolong Shen and Yuliang Wei and Changti Wu and Hang Yuan and Haishan Liu and Bailing Wang and Cong Huang and Kai Chen},
journal={arXiv preprint arXiv:2605.13757},
year={2026}
}
- Downloads last month
- -