ChuGyouk/KorMedMCQA_edited
收藏Hugging Face2024-07-10 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/ChuGyouk/KorMedMCQA_edited
下载链接
链接失效反馈官方服务:
资源简介:
KorMedMCQA(Edited Version)数据集来源于韩国健康人员执照考试,包括医生、护士、药剂师和牙医的考试题目。数据集的变化包括增加了牙医的考试题目和2024年的测试题目。数据集的语言为韩语,主要任务是问答。数据集的统计信息显示了每个类别的训练、开发和测试问题的数量。数据字段包括主题、年份、考试周期、问题编号、问题、五个答案选项和正确答案。
The KorMedMCQA (Edited Version) dataset is a Korean medical dataset designed for question-answering tasks, containing exam questions for four categories: doctor, nurse, pharmacist, and dentist. The dataset is sourced from the Korea Health Personnel Licensing Examination Institute, with each category having train, development, and test data. The dataset includes multiple fields such as subject, year, examination period, question number, question, answer choices, and correct answer. The datasets statistics show the number of questions for each category, and the dataset has undergone initial proofreading.
提供机构:
ChuGyouk
原始信息汇总
KorMedMCQA (Edited Version)
数据集详情
变更内容
- 添加牙医执业考试数据:使用2021年的问题集作为开发集,2022/2023/2024年的问题集作为测试集。开发集中仅包含5条数据,可能用于小样本学习。总共添加了816条数据。
- 为医生、护士和药剂师添加2024年的测试集问题:分别为医生、护士和药剂师添加了150、291和271条测试数据。
语言
韩语
子任务
python from datasets import load_dataset doctor = load_dataset(path = "ChuGyouk/KorMedMCQA_edited", name = "doctor") nurse = load_dataset(path = "ChuGyouk/KorMedMCQA_edited", name = "nurse") pharmacist = load_dataset(path = "ChuGyouk/KorMedMCQA_edited", name = "pharm") dentist = load_dataset(path = "ChuGyouk/KorMedMCQA_edited", name = "dentist")
统计数据
| 类别 | 问题数量 (训练/开发/测试) |
|---|---|
| 医生 | 2,489 (1,890/164/435) |
| 护士 | 1,751 (582/291/878) |
| 药剂师 | 1,817 (632/300/885) |
| 牙医 | 816 (0/5/811) |
数据字段
subject: 医生、护士、药剂师或牙医year: 考试年份period: 考试周期q_number: 考试问题编号question: 问题A: 第一个答案选项B: 第二个答案选项C: 第三个答案选项D: 第四个答案选项E: 第五个答案选项answer: 答案 (1到5)。1表示答案A,5表示答案E



