SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical Visual Question Answering
Abstract
SLAKE, a large bilingual dataset with semantic labels and a new structural medical knowledge base, facilitates the development and evaluation of Medical Visual Question Answering systems.
Medical visual question answering (Med-VQA) has tremendous potential in healthcare. However, the development of this technology is hindered by the lacking of publicly-available and high-quality labeled datasets for training and evaluation. In this paper, we present a large bilingual dataset, SLAKE, with comprehensive semantic labels annotated by experienced physicians and a new structural medical knowledge base for Med-VQA. Besides, SLAKE includes richer modalities and covers more human body parts than the currently available dataset. We show that SLAKE can be used to facilitate the development and evaluation of Med-VQA systems. The dataset can be downloaded from http://www.med-vqa.com/slake.
Get this paper in your agent:
hf papers read 2102.09542 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 51
Browse 51 models citing this paperDatasets citing this paper 2
Spaces citing this paper 291
Collections including this paper 0
No Collection including this paper