mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated Mar 11 β’ 864k β’ 817
Running on Zero Featured 1.86k Qwen3-TTS Demo π 1.86k Generate speech from text with custom voice, cloning, or presets
Configuration error Featured 131 Ministral WebGPU β‘ 131 Frontier multimodal AI, running entirely in your browser.
Running on CPU Upgrade 1.01k Open VLM Leaderboard π 1.01k VLMEvalKit Evaluation Results Collection
Running on Zero MCP 405 Multimodal OCR π 405 Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ Updated Sep 17, 2025 β’ 50.4k β’ 1.61k
Running on Zero Featured 1.76k Dia 1.6B π― 1.76k Generate realistic dialogue from a script, using Dia!