Spatial-SSRL Spatial Reasoning
Spatial reasoning with vision-language models
New Ghibli EasyControl model is now released!!
An Agentic Framework with Tools for Complex Reasoning
View the LMArena language model leaderboard
ElevenLab Italian demo
Easily expand image boundaries
Upgraded to v1.0!
Add a logo to anything
Audio Conditioned LipSync with Latent Diffusion Models
Colorize grayscale photos with AI-generated captions
Generate app code from your idea
Generate new person images with swapped clothes or poses
Convert images of screens to structured elements
Fill and modify images using a mask and prompt