Worldcoin/iris-semantic-segmentation
Updated β’ 18
Video deep fake
Wan2.2 Animate
Clarity AI Upscaler Reproduction
Generate spoken audio from text using selectable voices
Generate JSON for Google's Veo3
Wan2.1-T2V-14B + Fast 4-step with NAG + Automatic Audio
Generate lip-synced videos from images and audio
Audio Conditioned LipSync with Latent Diffusion Models
Translate key frames in a video using prompts
Generate images from text prompts
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
Generate images from text prompts with FLUX.1 diffusion model
Efficient, fast, and natural text to speech with StyleTTS 2!