Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
blanchefort
's Collections
Medical
VLA models
Audio
Translate
OCR
OmniModels
Edge models
Video encoders
Judge
Datasets for Embodied
Ru text encoders
Text2Image
VLMs
VLMs
updated
Mar 2
Upvote
-
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6, 2025
•
1.3M
•
1.27k
NVEagle/Eagle-X5-13B-Chat
Image-Text-to-Text
•
15B
•
Updated
Sep 16, 2024
•
15
•
28
internlm/internlm-xcomposer2d5-7b
Visual Question Answering
•
Updated
Jul 22, 2024
•
562
•
210
AIRI-Institute/OmniFusion
Updated
Apr 10, 2024
•
59
OpenGVLab/InternVideo2_chat_8B_HD
Video-Text-to-Text
•
8B
•
Updated
Dec 18, 2024
•
147
•
18
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
•
8B
•
Updated
Oct 10, 2024
•
320
•
26
zai-org/cogvlm2-video-llama3-chat
Text Generation
•
13B
•
Updated
Jul 24, 2024
•
374
•
55
nyu-visionx/cambrian-34b
Text Generation
•
35B
•
Updated
Jun 28, 2024
•
21
•
27
zai-org/cogvlm-base-490-hf
Text Generation
•
18B
•
Updated
Nov 20, 2023
•
41
•
7
zai-org/cogvlm-chat-hf
Text Generation
•
18B
•
Updated
Dec 19, 2023
•
904
•
199
zai-org/cogvlm-grounding-generalist-hf
Text Generation
•
18B
•
Updated
Dec 11, 2023
•
189
•
16
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25, 2024
•
42.9k
•
277
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
171k
•
549
LanguageBind/MoE-LLaVA-Phi2-2.7B-4e-384
Text Generation
•
6B
•
Updated
Feb 1, 2024
•
18
•
32
LanguageBind/Video-LLaVA-7B-hf
Image-Text-to-Text
•
7B
•
Updated
May 16, 2024
•
13.2k
•
50
openvla/openvla-7b-prismatic
Image-Text-to-Text
•
Updated
Jul 9, 2024
•
78
•
6
openvla/openvla-7b-finetuned-libero-object
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
10.6k
•
1
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
5.55k
•
5
IntelLabs/LlavaOLMoBitnet1B
Updated
Aug 30, 2024
•
161
•
30
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
Oct 1, 2024
•
6.44k
•
381
LanguageBind/MoE-LLaVA-StableLM-1.6B-4e
Text Generation
•
3B
•
Updated
Feb 1, 2024
•
48
•
8
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
7B
•
Updated
Nov 11, 2025
•
92.8k
•
121
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
Jan 25, 2024
•
70.7k
•
383
LanguageBind/Video-LLaVA-7B
Text Generation
•
7B
•
Updated
Apr 9, 2024
•
1.52k
•
89
LanguageBind/LanguageBind_Image
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
18.5k
•
11
LanguageBind/LanguageBind_Video
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
3.92k
•
3
llava-hf/llava-1.5-13b-hf
Image-Text-to-Text
•
13B
•
Updated
Jan 27, 2025
•
10k
•
34
llava-hf/llava-1.5-7b-hf
Image-Text-to-Text
•
7B
•
Updated
Jun 6, 2025
•
2.53M
•
354
FreedomIntelligence/LongLLaVA-53B-A13B
Image-Text-to-Text
•
52B
•
Updated
Nov 28, 2024
•
38
•
20
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
11B
•
Updated
Sep 27, 2024
•
13.3k
•
586
BAAI/Emu3-VisionTokenizer
Feature Extraction
•
0.3B
•
Updated
Oct 8, 2024
•
7.95k
•
62
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
Jun 13, 2025
•
152k
•
1.04k
openbmb/MiniCPM-V
Visual Question Answering
•
3B
•
Updated
Jan 15, 2025
•
1.25k
•
200
openbmb/MiniCPM-V-2
Visual Question Answering
•
3B
•
Updated
Jan 15, 2025
•
73.9k
•
495
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
9B
•
Updated
Jan 15, 2025
•
60.3k
•
1.41k
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14, 2025
•
61.7k
•
775
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Sep 23, 2025
•
2.42M
•
1.4k
allenai/Molmo-72B-0924
Image-Text-to-Text
•
73B
•
Updated
Oct 9, 2025
•
5.45k
•
298
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
Apr 24, 2025
•
1.02k
•
157
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Dec 15, 2025
•
18.6k
•
565
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2025
•
1.37k
•
163
deepseek-ai/Janus-1.3B
Any-to-Any
•
2B
•
Updated
Jan 27, 2025
•
4.1k
•
595
neulab/Pangea-7B
8B
•
Updated
Oct 24, 2024
•
370
•
133
neulab/Pangea-7B-hf
8B
•
Updated
Oct 28, 2025
•
128
•
13
BAAI/Aquila-VL-2B-llava-qwen
Visual Question Answering
•
Updated
Nov 25, 2024
•
190
•
61
mistralai/Pixtral-Large-Instruct-2411
Updated
Jul 28, 2025
•
144
•
433
google/paligemma2-10b-pt-224
Image-Text-to-Text
•
10B
•
Updated
Dec 5, 2024
•
765
•
8
google/paligemma2-3b-pt-224
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
19.1k
•
167
vidore/colqwen2-v1.0
Visual Document Retrieval
•
Updated
Jun 5, 2025
•
46.8k
•
116
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
Feb 1, 2025
•
65.4k
•
3.57k
deepseek-ai/Janus-Pro-1B
Any-to-Any
•
Updated
Feb 1, 2025
•
15.3k
•
474
nvidia/Eagle2-9B
Image-Text-to-Text
•
9B
•
Updated
Jan 28, 2025
•
347
•
63
openbmb/MiniCPM-o-2_6
Any-to-Any
•
9B
•
Updated
Oct 5, 2025
•
116k
•
1.29k
DAMO-NLP-SG/VideoLLaMA3-7B
Video-Text-to-Text
•
8B
•
Updated
Sep 2, 2025
•
79.7k
•
75
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text
•
2B
•
Updated
Sep 3, 2025
•
1.08k
•
21
AIDC-AI/Ovis2-8B
Image-Text-to-Text
•
9B
•
Updated
Aug 15, 2025
•
1.34k
•
75
Qwen/Qwen3-VL-2B-Thinking
Image-Text-to-Text
•
2B
•
Updated
Oct 20, 2025
•
65.8k
•
110
LiquidAI/LFM2-VL-3B
Image-Text-to-Text
•
3B
•
Updated
17 days ago
•
12.2k
•
133
facebook/sam3
Mask Generation
•
0.9B
•
Updated
Nov 20, 2025
•
2.05M
•
1.87k
stepfun-ai/Step3-VL-10B-FP8
Image-Text-to-Text
•
Updated
Feb 4
•
618
•
10
nvidia/llama-nemotron-colembed-vl-3b-v2
Visual Document Retrieval
•
4B
•
Updated
Feb 21
•
2.54k
•
21
Upvote
-
Share collection
View history
Collection guide
Browse collections