| library_name: transformers | |
| license: mit | |
| datasets: | |
| - array/SAT | |
| # Model Card for Model ID | |
| Please check https://github.com/arijitray1993/SAT on how to run inference with this model. | |
| If you use the model, please cite: | |
| ``` | |
| @misc{ray2024satspatialaptitudetraining, | |
| title={SAT: Spatial Aptitude Training for Multimodal Language Models}, | |
| author={Arijit Ray and Jiafei Duan and Reuben Tan and Dina Bashkirova and Rose Hendrix and Kiana Ehsani and Aniruddha Kembhavi and Bryan A. Plummer and Ranjay Krishna and Kuo-Hao Zeng and Kate Saenko}, | |
| year={2024}, | |
| eprint={2412.07755}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CV}, | |
| url={https://arxiv.org/abs/2412.07755}, | |
| } | |
| ``` |