Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
xiaomoguhzz
/
VisionEncoder
like
0
Transformers
Safetensors
vision-encoder
distillation
video-language
siglip2
dinov3
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VisionEncoder
/
video_mllm_swift
210 GB
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
This model has 108 files scanned as unsafe.
Show
files
xiaomoguhzz
Upload folder using huggingface_hub
da24e2a
verified
9 days ago
s1_declip_siglip2_qwen3_1.7b
Upload folder using huggingface_hub
9 days ago
s1_siglip2_qwen3_1.7b
Upload folder using huggingface_hub
9 days ago
s2_declip_siglip2_qwen3_1.7b_10pct
Upload folder using huggingface_hub
9 days ago
s2_image_only_10pct
Upload folder using huggingface_hub
9 days ago
s2_siglip2_qwen3_1.7b_10pct
Upload folder using huggingface_hub
9 days ago