Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
8
1
yes liu
zhizhou57
Follow
21world's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
reacted
to
Juanxi
's
post
with 🔥
3 days ago
📢 Awesome Multimodal Modeling We introduce Awesome Multimodal Modeling, a curated repository tracing the architectural evolution of multimodal intelligence—from foundational fusion to native omni-models. 🔹 Taxonomy & Evolution: Traditional Multimodal Learning – Foundational work on representation, fusion, and alignment. Multimodal LLMs (MLLMs) – Architectures connecting vision encoders to LLMs for understanding. Unified Multimodal Models (UMMs) – Models unifying Understanding + Generation via Diffusion, Autoregressive, or Hybrid paradigms. Native Multimodal Models (NMMs) – Models trained from scratch on all modalities; contrasts early vs. late fusion under scaling laws. 💡 Key Distinction: UMMs unify tasks via generation heads; NMMs enforce interleaving through joint pre-training. 🔗 Explore & Contribute: https://github.com/OpenEnvision/Awesome-Multimodal-Modeling
commented
on
a paper
5 months ago
Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT
authored
a paper
5 months ago
Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT
View all activity
Organizations
zhizhou57
's datasets
2
Sort: Recently updated
zhizhou57/ReVeL-benchmarks
Preview
•
Updated
Nov 20, 2025
•
7
•
1
zhizhou57/revel-datasets
Viewer
•
Updated
Nov 20, 2025
•
19.2k
•
2
•
2