Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
falamarcao 's Collections
Testing 1.. 2… 3…
AI Agent
Veterinary
On Device (local)
Start Here
omni models (text, image, audio, video)
Speech related
Web GPU
Software Engineering
Tracker
Speech-to-speech
MCP Servers
computer-use
Speech-to-text
Index-embed
3D
Code
Object Detection
Safety
Parser
Multimodal
Specialized
OCR
Video
Image
Audio
LLM
Text-to-speech

omni models (text, image, audio, video)

updated Sep 22, 2025
Upvote
-

  • Running
    Agents
    Featured
    262

    Qwen3 Omni Demo

    ⚡
    262

    Chat with multimodal AI using text, audio, images, and video


  • Running
    Agents
    61

    Qwen3 Omni Captioner Demo

    🐠
    61

    Generate captions from audio


  • Qwen/Qwen3-Omni-30B-A3B-Thinking

    Any-to-Any • 32B • Updated Sep 22, 2025 • 28.7k • 299

  • Qwen/Qwen3-Omni-30B-A3B-Instruct

    Any-to-Any • 35B • Updated Sep 22, 2025 • 421k • 911

  • Qwen/Qwen3-Omni-30B-A3B-Captioner

    Any-to-Any • 32B • Updated Sep 22, 2025 • 6.55k • 221
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs