VLMsAreBlind ResultsReview
π
4
Review model results on visual tasks
Review model results on visual tasks
BugsBunny-XLarge
Generate captions and chat responses from your images
Game screenshots
Display random gameplay images with action buttons
The AniMer Demo
Detect and classify animals in images and videos
Flexible Photo Recrafting While Preserving Your Identity
Generate speech from text using a reference voice