Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
266.4
TFLOPS
686
1312
4965
Victor Mustar
PRO
victor
Follow
weifei's profile picture
dloader's profile picture
OMCLab's profile picture
5,445 followers
·
1,723 following
victormustar
AI & ML interests
Building the UX of this website
Recent Activity
reacted
to
Juanxi
's
post
with 🔥
about 7 hours ago
📢 Awesome Multimodal Modeling We introduce Awesome Multimodal Modeling, a curated repository tracing the architectural evolution of multimodal intelligence—from foundational fusion to native omni-models. 🔹 Taxonomy & Evolution: Traditional Multimodal Learning – Foundational work on representation, fusion, and alignment. Multimodal LLMs (MLLMs) – Architectures connecting vision encoders to LLMs for understanding. Unified Multimodal Models (UMMs) – Models unifying Understanding + Generation via Diffusion, Autoregressive, or Hybrid paradigms. Native Multimodal Models (NMMs) – Models trained from scratch on all modalities; contrasts early vs. late fusion under scaling laws. 💡 Key Distinction: UMMs unify tasks via generation heads; NMMs enforce interleaving through joint pre-training. 🔗 Explore & Contribute: https://github.com/OpenEnvision-Lab/Awesome-Multimodal-Modeling
liked
a model
about 10 hours ago
MiniMaxAI/MiniMax-M2.7
liked
a Space
about 23 hours ago
manasha2006/FoodCrisisEnv
View all activity
Organizations
victor
's buckets
9
victor/snapshots
3.18 MB
victor/qwen35-test-results
648 kB
victor/qwen35-test-scripts
47.8 kB
victor/autotrain-japanese-qwen35-2b
10.8 kB
victor/training-artifacts-v2
5.22 MB
victor/misc
175 kB
victor/test2323
0 Bytes
victor/caca2
0 Bytes
victor/hello
93 Bytes