CXR-PRO: MIMIC-CXR with Prior References Omitted
收藏physionet.org2025-01-21 收录
下载链接:
https://physionet.org/content/cxr-pro/1.0.0/
下载链接
链接失效反馈官方服务:
资源简介:
CXR-PRO is an adaptation of the MIMIC-CXR dataset that omits references to prior radiology reports. Consisting of 374,139 free-text radiology reports and associated chest radiographs, CXR-PRO addresses the issue of hallucinated references to priors produced by radiology report generation models. By removing nearly all prior references in MIMIC-CXR, CXR-PRO, when used as training data for report generation models, is capable of broadly improving the factual consistency and accuracy of generated reports. More generally, this dataset aims to support a wide body of research in medical image analysis and natural language processing. MIMIC-CXR is a de-identified dataset, so no protected health information (PHI) is included.
CXR-PRO是MIMIC-CXR数据集的改编版本,其中剔除了对先前放射学报告的引用。该数据集包含374,139份自由文本形式的放射学报告及其相应的胸部X光片,旨在解决由放射学报告生成模型产生的对先前信息的幻觉引用问题。通过几乎完全去除MIMIC-CXR中的先前引用,CXR-PRO在作为报告生成模型的训练数据时,能够显著提升生成报告的事实一致性和准确性。更广泛地说,本数据集旨在支持医学图像分析和自然语言处理领域的一系列研究。MIMIC-CXR为一去识别化数据集,因此不包含受保护的个人信息(PHI)。
提供机构:
physionet.org



