FluidInference/parakeet-realtime-eou-120m-coreml Automatic Speech Recognition β’ Updated Mar 14 β’ 15.9k β’ 4
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated Dec 10, 2025 β’ 329k β’ 1.58k
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text β’ Updated Apr 8, 2025 β’ 326k β’ 130
openai/clip-vit-large-patch14 Zero-Shot Image Classification β’ 0.4B β’ Updated Sep 15, 2023 β’ 29.3M β’ 1.99k
Runtime error Featured 272 Edit Video By Editing Text β 272 Audio-based video editing using AI-generated transcription