video/image - a dbest111 Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

dbest111 's Collections

video/image

updated Jul 24, 2025

google/vit-base-patch16-224

Image Classification • 86.6M • Updated Sep 5, 2023 • 4.57M • • 949
OpenGVLab/internimage_g_jointto22k_384

Image Classification • 3B • Updated Mar 25, 2025 • 19 • 1
chancharikm/qwen2.5-vl-72b-cam-motion

Video-Text-to-Text • 73B • Updated Sep 19, 2025 • 434 • 1
lmms-lab/Aero-1-Audio

Text Generation • 2B • Updated Jun 7, 2025 • 717 • 91
mipal/AVATAR

Updated Nov 3, 2025 • 36 • 1
zl2048/FAVOR

Viewer • Updated Aug 1, 2025 • 27.1k • 1.16k • 2
lmms-lab/VideoMMMU

Viewer • Updated May 5, 2025 • 900 • 3.39k • 11
moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Jan 30 • 49.3k • 355
lmms-lab/llava-critic-113k

Viewer • Updated Oct 5, 2024 • 113k • 937 • 28
lmms-lab/M4-Instruct-Data

Updated Jul 21, 2024 • 1.04k • 78
lmms-lab/llava-next-interleave-qwen-7b

Text Generation • 8B • Updated Jul 24, 2024 • 316 • 27
lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24, 2025 • 3.94M • 27.5k • 234
avalab/syndicom

Viewer • Updated May 10, 2024 • 19.2k • 15
avalab/iTBLS

Viewer • Updated Jan 17, 2025 • 12.5k • 7
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Paper • 2312.14378 • Published Dec 22, 2023
avalab/cTBLS_knowledge_retriever

Updated Jan 12, 2024
avalab/cTBLS_encoder

Updated Apr 27, 2023
CraftJarvis/minecraft-vla-sft

Viewer • Updated Mar 21, 2025 • 3.78M • 354 • 10

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs