Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-VL-30B-A3B-Instruct
like
562
Follow
Qwen
80.7k
Image-Text-to-Text
Transformers
Safetensors
qwen3_vl_moe
conversational
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
11
Deploy
Use this model
how to use vllm to speed up inference?
#11
by
menglan
- opened
Jan 14
Discussion
menglan
Jan 14
can give an example of how to using vllm to speed up?
thanks.
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment