Vision
updated
liuhaotian/llava-v1.6-34b
Image-Text-to-Text
• 35B • Updated • 32.9k
• 362
deepseek-ai/deepseek-vl-7b-base
7B • Updated • 88
• 65
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
• 7B • Updated • 4.19k
• 270
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
• 8B • Updated • 142k
• 621
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
• 8B • Updated • 75
• 95
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text
• 8B • Updated • 1.21k
• 28
google/paligemma-3b-pt-896
Image-Text-to-Text
• 3B • Updated • 588
• 124
microsoft/Phi-3-vision-128k-instruct
Text Generation
• Updated • 99.6k
• 970
Image-Text-to-Text
• 7B • Updated • 85.5k
• 200
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
• Updated • 1.49M
• 730
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
• 11B • Updated • 13.3k
• 586
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
• 11B • Updated • 207k
• 1.58k
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
• 89B • Updated • 2.59k
• 134
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
• 89B • Updated • 12.2k
• 355
meta-llama/Llama-Guard-3-11B-Vision
Image-Text-to-Text
• 11B • Updated • 2.02k
• 71
Image-Text-to-Text
• 73B • Updated • 5.4k
• 298
Image-Text-to-Text
• 8B • Updated • 19k
• 565
Image-Text-to-Text
• 8B • Updated • 1.29k
• 163
Image-Text-to-Text
• Updated • 1.04k
• 157
Text-to-Video
• Updated • 5.3k
• • 1.32k
Image-Text-to-Text
• Updated • 265
• 1.71k
Image-to-Video
• Updated • 559k
• • 2.16k