Extended ACI-BENCH Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://doi.org/10.5281/zenodo.13308316
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含医疗记录的数据集,这些记录使用ICD-10代码进行了标注,并附有疾病诊断和支持性证据文本。它扩展了环境临床智能基准(ACI-BENCH)数据集。此外,该数据集的独特之处在于,它包含了ICD-10代码及其相关诊断的支持性证据文本,这是现有基准中未曾有的新特性。数据集规模包括207份临床笔记(其中184份为编码笔记,20份用于A/B测试)。其任务是对ICD-10编码和疾病诊断的提取。
This dataset is a collection of medical records annotated with ICD-10 codes, paired with disease diagnoses and supporting evidence texts. It extends the Ambient Clinical Intelligence Benchmark (ACI-BENCH) dataset. Notably, a distinctive feature of this dataset is that it incorporates supporting evidence texts for ICD-10 codes and their corresponding diagnoses, a novel attribute not present in existing benchmark datasets. The dataset contains a total of 207 clinical notes, including 184 annotated clinical notes and 20 notes designated for A/B testing. The core task of this dataset is the extraction of ICD-10 codes and disease diagnoses.



