Wenbo Hu's picture

Wenbo Hu

gordonhu

·

https://gordonhu608.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

authored a paper 4 days ago

Matryoshka Query Transformer for Large Vision-Language Models

authored a paper 4 days ago

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

View all activity

Organizations

commented a paper 5 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 6 days ago • 46 •

commented a paper 5 months ago

G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Paper • 2511.21688 • Published Nov 26, 2025 • 8 •

New activity in gordonhu/MQT-LLaVA almost 2 years ago

Apply for community grant: Personal project (gpu)

#1 opened almost 2 years ago by

Apply for community grant: Academic project (gpu)

#2 opened almost 2 years ago by

New activity in mlpc-lab/BLIVA_Vicuna over 2 years ago

Missing config.json

#1 opened over 2 years ago by