5 9 73

Wenbo Hu

gordonhu

https://gordonhu608.github.io/

AI & ML interests

None yet

Recent Activity

liked a dataset about 19 hours ago

OpenSeeker/OpenSeeker-v1-Data

liked a dataset about 19 hours ago

zlab-princeton/Vero-600k

authored a paper 6 days ago

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

View all activity

Organizations

upvoted a paper 6 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 7 days ago • 48

upvoted 2 papers 4 months ago

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Paper • 2512.10863 • Published Dec 11, 2025 • 22

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

Paper • 2512.10284 • Published Dec 11, 2025 • 26

upvoted a paper 5 months ago

G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Paper • 2511.21688 • Published Nov 26, 2025 • 8

upvoted a collection 11 months ago

Perception Encoder

Collection

16 items • Updated Mar 2 • 80

upvoted an article almost 2 years ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.11k

upvoted 2 collections almost 2 years ago

Model Checkpoints in the ExPO Paper

Collection

15 items • Updated May 19, 2024 • 2

Model Extrapolation Expedites Alignment

Collection

Better aligned models obtained by model extrapolation (ExPO) • 23 items • Updated Mar 2 • 17

upvoted a paper over 2 years ago

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 86

Wenbo Hu

AI & ML interests

Recent Activity

Organizations

gordonhu's activity

Mixture of Experts Explained