RadQA: A Question Answering Dataset to Improve Comprehension of Radiology Reports
收藏DataCite Commons2022-12-09 更新2025-04-16 收录
下载链接:
https://physionet.org/content/radqa/1.0.0/
下载链接
链接失效反馈官方服务:
资源简介:
We present a radiology question answering dataset, RadQA, with 3074 questions
posed against radiology reports and annotated with their corresponding answer
spans (resulting in a total of 6148 question-answer evidence pairs) by
physicians. The questions are manually created using the clinical referral
section of the reports that take into account the actual information needs of
ordering physicians and eliminate bias from seeing the answer context (and,
further, organically create unanswerable questions). The answer spans are
marked within the Findings and Impressions sections of a report. The dataset
aims to satisfy the complex clinical requirements by including complete (yet
concise) answer phrases (which are not just entities) that can span multiple
lines. In published work, we conducted a thorough analysis of the proposed
dataset by examining the broad categories of disagreement in annotation
(providing insights on the errors made by humans) and the reasoning
requirements to answer a question (uncovering the huge dependence on medical
knowledge for answering the questions). In that work, the best-performing
transformer language model achieved an F1 of 63.55 on the test set. However,
the top human-level performance on this dataset is 90.31 (with an average
human performance of 84.52), which demonstrates the challenging nature of
RadQA that leaves ample scope for future method research.
提供机构:
PhysioNet
创建时间:
2022-11-18



