Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts
Paper • 2111.08276 • Published
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
This checkpoint is copied from https://drive.google.com/drive/folders/1VotCNmdevvtMuJmdxPfg3MOZXJRnV96D
This repo is created for convenient huggingface_hub loading. The original LICENSE is included here.
Citation:
@article{xvlm,
title={Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts},
author={Zeng, Yan and Zhang, Xinsong and Li, Hang},
journal={arXiv preprint arXiv:2111.08276},
year={2021}
}