Papers
arxiv:2203.14371

MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering

Published on Mar 27, 2022
Authors:

Abstract

A large-scale MCQA dataset addressing real-world medical entrance exam questions is introduced, testing models across various medical subjects and reasoning abilities.

AI-generated summary

This paper introduces MedMCQA, a new large-scale, Multiple-Choice Question Answering (MCQA) dataset designed to address real-world medical entrance exam questions. More than 194k high-quality AIIMS \& NEET PG entrance exam MCQs covering 2.4k healthcare topics and 21 medical subjects are collected with an average token length of 12.77 and high topical diversity. Each sample contains a question, correct answer(s), and other options which requires a deeper language understanding as it tests the 10+ reasoning abilities of a model across a wide range of medical subjects \& topics. A detailed explanation of the solution, along with the above information, is provided in this study.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2203.14371
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 18

Browse 18 models citing this paper

Datasets citing this paper 11

Browse 11 datasets citing this paper

Spaces citing this paper 83

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.