unsloth/gemma-4-E2B-it-GGUF
Image-Text-to-Text β’ 5B β’ Updated β’ 600k β’ 124
Dub videos in another language with cloned voice
Generate a talking face video from an image and audio
Explore speech recognition model benchmarks and rankings
Generate and listen to creative stories
Generate multilingual talking-face videos from your text
Generate realistic audio from text
Generate animated face images using a driving video