sameedkhan/medconceptsqa-sample_medarc_2k
收藏Hugging Face2025-11-05 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/sameedkhan/medconceptsqa-sample_medarc_2k
下载链接
链接失效反馈官方服务:
资源简介:
MedARC 2K per Split 子样本是从MedConceptsQA中分层抽样的ICD10-CM编码系统样本,用于在所有层级上实现最大覆盖度,以尽可能具有代表性。该数据集包含问题和答案,以及不同难度级别(简单、中等、困难)的训练和测试数据。适用于文本分类任务,是医学领域的数据集。
MedARC 2K per Split Subsample is a hierarchically stratified sample of the ICD10-CM coding system from MedConceptsQA, designed to achieve maximum coverage across all levels to be as representative as possible. The dataset includes questions and answers, along with training and test data at different difficulty levels (easy, medium, hard). It is suitable for text classification tasks and is a dataset related to the medical field.
提供机构:
sameedkhan



