naver-clova-ix/donut-base
Image-to-Text β’ Updated β’ 194k β’ 252
Audio Conditioned LipSync with Latent Diffusion Models
Generate story images from text and optional reference photos
Import a portrait, click to move the head!
Edit photos with scribbles and AI-driven color changes
Line Art Colorization with Precise Reference Following
Track, rank and evaluate open LLMs and chatbots
Explore and submit LLM benchmarks
Transcribe audio files into text
ALA