Collections of multimodal (image+text) instruction finetuning datasets tailored for visual language models like LlaVA, Fuyu, or IDEFICS.
Victor Sanh PRO
VictorSanh
AI & ML interests
None yet
Recent Activity
liked a Space 7 days ago
HuggingFaceFW/finephrase liked a dataset 2 months ago
notesbymuneeb/epstein-emails liked a model 3 months ago
moonshotai/Kimi-K2.5