Collections

Discover the best community collections!

Collections including paper arxiv:2310.03744
LLaVa-NeXT
LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets.
Top Vision-Language Papers πŸ–ΌοΈπŸ’¬πŸ“
A curated list of papers on vision-language models, with the most influential ones at the top.
multilingual vision models
Some papers I read for understanding vision models and also adding multilingual capabilities to them
Multimodal Papers
Collection by
Apr 22, 2024
vision
Collection by
13 days ago
MM-LLMs
Collection by
Sep 9, 2024
Vision Language Models Papers πŸ–ΌοΈπŸ’¬πŸ“
Papers about vision-language models, most important ones are on top of the list.
LLaVa-NeXT
LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets.
vision
Collection by
13 days ago
Top Vision-Language Papers πŸ–ΌοΈπŸ’¬πŸ“
A curated list of papers on vision-language models, with the most influential ones at the top.
MM-LLMs
Collection by
Sep 9, 2024
multilingual vision models
Some papers I read for understanding vision models and also adding multilingual capabilities to them
Multimodal Papers
Collection by
Apr 22, 2024
Vision Language Models Papers πŸ–ΌοΈπŸ’¬πŸ“
Papers about vision-language models, most important ones are on top of the list.