Diabetes Question-Answering Model Evaluation Test Set
收藏DataCite Commons2025-12-05 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=49c33ac68fcb4c1ba6e9776ce6432026
下载链接
链接失效反馈官方服务:
资源简介:
This test set was curated through a systematic selection process based on the CMtMedQA dataset (https://huggingface.co/datasets/Suprit/CMtMedQA). CMtMedQA, provided by Suprit, contains approximately 70,000 authentic multi-turn doctor–patient dialogues across 14 clinical departments and is widely used as a foundational resource in Chinese medical dialogue research.To construct the present test set, candidate samples were first retrieved from the original CMtMedQA dialogues using domain-specific keywords such as “diabetes,” “endocrinology,” and “blood glucose,” yielding a subset relevant to chronic disease scenarios. Subsequently, in accordance with the requirements of multi-turn interaction, only dialogues in which the patient contributed more than three turns were retained. All candidate dialogues underwent case-by-case verification by clinical experts, who reviewed the medical topics, content completeness, and linguistic accuracy. After this expert validation process, a total of 100 authentic multi-turn patient dialogues were selected to form the released test set, providing data support for question-answering tasks related to chronic disease management.
提供机构:
Science Data Bank
创建时间:
2025-12-05



