wzh
hg2wzh
AI & ML interests
None yet
Recent Activity
liked a model 3 days ago
Qwen/Qwen-Image-Edit-2511 upvoted a collection 3 days ago
Qwen-Image liked a model 4 days ago
zai-org/GLM-5.1Organizations
None yet
Datasets
Embedding
VLMs
-
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 79 -
Multimodal Latent Language Modeling with Next-Token Diffusion
Paper • 2412.08635 • Published • 49 -
AIDC-AI/Ovis2-2B
Image-Text-to-Text • Updated • 342 • 60 -
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text • 2B • Updated • 6.75k • 21
Eval-bench
Text-to-Image
Datasets
Reasoning
Embedding
CLIP series
VLMs
-
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 79 -
Multimodal Latent Language Modeling with Next-Token Diffusion
Paper • 2412.08635 • Published • 49 -
AIDC-AI/Ovis2-2B
Image-Text-to-Text • Updated • 342 • 60 -
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text • 2B • Updated • 6.75k • 21
LLMs