Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sahu's picture
3 13

Sahu

Manishsahu53
·
https://github.com/ManishSahu53
  • ManishSahu53

AI & ML interests

None yet

Recent Activity

liked a Space 28 minutes ago
HuggingFaceTB/trl-distillation-trainer
reacted to Juanxi's post with 🔥 about 18 hours ago
📢 Awesome Multimodal Modeling We introduce Awesome Multimodal Modeling, a curated repository tracing the architectural evolution of multimodal intelligence—from foundational fusion to native omni-models. 🔹 Taxonomy & Evolution: Traditional Multimodal Learning – Foundational work on representation, fusion, and alignment. Multimodal LLMs (MLLMs) – Architectures connecting vision encoders to LLMs for understanding. Unified Multimodal Models (UMMs) – Models unifying Understanding + Generation via Diffusion, Autoregressive, or Hybrid paradigms. Native Multimodal Models (NMMs) – Models trained from scratch on all modalities; contrasts early vs. late fusion under scaling laws. 💡 Key Distinction: UMMs unify tasks via generation heads; NMMs enforce interleaving through joint pre-training. 🔗 Explore & Contribute: https://github.com/OpenEnvision/Awesome-Multimodal-Modeling
liked a model 5 days ago
black-forest-labs/FLUX.2-small-decoder
View all activity

Organizations

lorafrenzi's profile picture

Manishsahu53 's Spaces 2

Running

Fashiongenie Ai Magical Model Maker

⚡

Oct 30, 2025
Running

Cardgenius Ai Powered Perfection

🦀

Create a static web page by editing HTML

Oct 13, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs