HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text • 0.3B • Updated • 435k • 352
Collection for models & demos for even smoller SmolVLM release
Generate descriptions from images and text prompts
Generate captions for your images instantly