Sa2VA Simple Demo
🐨
53
Dense Grounded Understanding of Images and Videos
Dense Grounded Understanding of Images and Videos
Inpaint videos by adding masks and removing unwanted objects
Transcribe videos and generate concise summaries
ColorFlow: Retrieval-Augmented Image Sequence Colorization
Create images from text and reference photos