med_qa
收藏魔搭社区2026-05-15 更新2025-01-18 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/med_qa
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for MedQA
## Dataset Description
- **Homepage:** https://github.com/jind11/MedQA
- **Pubmed:** False
- **Public:** True
- **Tasks:** QA
In this work, we present the first free-form multiple-choice OpenQA dataset for solving medical problems, MedQA,
collected from the professional medical board exams. It covers three languages: English, simplified Chinese, and
traditional Chinese, and contains 12,723, 34,251, and 14,123 questions for the three languages, respectively. Together
with the question data, we also collect and release a large-scale corpus from medical textbooks from which the reading
comprehension models can obtain necessary knowledge for answering the questions.
## Citation Information
```
@article{jin2021disease,
title={What disease does this patient have? a large-scale open domain question answering dataset from medical exams},
author={Jin, Di and Pan, Eileen and Oufattole, Nassim and Weng, Wei-Hung and Fang, Hanyi and Szolovits, Peter},
journal={Applied Sciences},
volume={11},
number={14},
pages={6421},
year={2021},
publisher={MDPI}
}
```
# MedQA 数据集卡片
## 数据集描述
- **主页**:https://github.com/jind11/MedQA
- **PubMed关联状态**:否
- **公开可用状态**:是
- **任务类型**:问答(QA)
本研究提出了首个面向医学问题求解的自由格式多项选择开放域问答数据集MedQA,该数据集采集自专业医学执业资格考试。数据集涵盖英语、简体中文及繁体中文三种语言,对应题量分别为12723道、34251道及14123道。除问答题目数据外,我们还采集并发布了一套源自医学教科书的大规模语料库,可供阅读理解模型获取解答问题所需的必要知识。
## 引用信息
@article{jin2021disease,
title={该患者罹患何种疾病?源自医学考试的大规模开放域问答数据集},
author={Jin, Di and Pan, Eileen and Oufattole, Nassim and Weng, Wei-Hung and Fang, Hanyi and Szolovits, Peter},
journal={Applied Sciences(应用科学)},
volume={11},
number={14},
pages={6421},
year={2021},
publisher={MDPI}
}
提供机构:
maas
创建时间:
2025-01-17



