RadQA: A Question Answering Dataset to Improve Comprehension of Radiology Reports

Name: RadQA: A Question Answering Dataset to Improve Comprehension of Radiology Reports
Creator: PhysioNet
Published: 2022-12-09 12:53:05
License: 暂无描述

DataCite Commons2022-12-09 更新2025-04-16 收录

下载链接：

https://physionet.org/content/radqa/1.0.0/

下载链接

链接失效反馈

官方服务：

资源简介：

We present a radiology question answering dataset, RadQA, with 3074 questions posed against radiology reports and annotated with their corresponding answer spans (resulting in a total of 6148 question-answer evidence pairs) by physicians. The questions are manually created using the clinical referral section of the reports that take into account the actual information needs of ordering physicians and eliminate bias from seeing the answer context (and, further, organically create unanswerable questions). The answer spans are marked within the Findings and Impressions sections of a report. The dataset aims to satisfy the complex clinical requirements by including complete (yet concise) answer phrases (which are not just entities) that can span multiple lines. In published work, we conducted a thorough analysis of the proposed dataset by examining the broad categories of disagreement in annotation (providing insights on the errors made by humans) and the reasoning requirements to answer a question (uncovering the huge dependence on medical knowledge for answering the questions). In that work, the best-performing transformer language model achieved an F1 of 63.55 on the test set. However, the top human-level performance on this dataset is 90.31 (with an average human performance of 84.52), which demonstrates the challenging nature of RadQA that leaves ample scope for future method research.

提供机构：

PhysioNet

创建时间：

2022-11-18

5,000+

优质数据集

54 个

任务类型

进入经典数据集