Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenbo Hu's picture
5 9 71

Wenbo Hu

gordonhu
HongBenXian's profile picture 21world's profile picture mohsenfayyaz's profile picture
·
https://gordonhu608.github.io/
  • gordonhu608
  • gordonhu608

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
authored a paper 4 days ago
Matryoshka Query Transformer for Large Vision-Language Models
authored a paper 4 days ago
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
View all activity

Organizations

UCLA NLP's profile picture mlpc-ucsd's profile picture Intern Robotics's profile picture

commented a paper 5 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 6 days ago • 46 •
2
commented a paper 5 months ago

G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Paper • 2511.21688 • Published Nov 26, 2025 • 8 •
2
New activity in gordonhu/MQT-LLaVA almost 2 years ago

Apply for community grant: Personal project (gpu)

#1 opened almost 2 years ago by
gordonhu

Apply for community grant: Academic project (gpu)

3
#2 opened almost 2 years ago by
gordonhu
New activity in mlpc-lab/BLIVA_Vicuna over 2 years ago

Missing config.json

1
#1 opened over 2 years ago by
arnaudstiegler
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs