five

Diabetes Question-Answering Model Evaluation Test Set

收藏
科学数据银行2025-11-22 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=49c33ac68fcb4c1ba6e9776ce6432026
下载链接
链接失效反馈
官方服务:
资源简介:
This test set was curated through a systematic selection process based on the CMtMedQA dataset (https://huggingface.co/datasets/Suprit/CMtMedQA). CMtMedQA, provided by Suprit, contains approximately 70,000 authentic multi-turn doctor–patient dialogues across 14 clinical departments and is widely used as a foundational resource in Chinese medical dialogue research.To construct the present test set, candidate samples were first retrieved from the original CMtMedQA dialogues using domain-specific keywords such as “diabetes,” “endocrinology,” and “blood glucose,” yielding a subset relevant to chronic disease scenarios. Subsequently, in accordance with the requirements of multi-turn interaction, only dialogues in which the patient contributed more than three turns were retained. All candidate dialogues underwent case-by-case verification by clinical experts, who reviewed the medical topics, content completeness, and linguistic accuracy. After this expert validation process, a total of 100 authentic multi-turn patient dialogues were selected to form the released test set, providing data support for question-answering tasks related to chronic disease management.
提供机构:
ma ya kun
创建时间:
2025-11-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作