five

Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation

收藏
DataCite Commons2026-01-12 更新2026-05-04 收录
下载链接:
https://physionet.org/content/lunguage/
下载链接
链接失效反馈
官方服务:
资源简介:
Radiology reports convey detailed clinical observations and capture diagnostic reasoning that evolves over time. However, existing evaluation methods are limited to single-report settings and rely on coarse metrics that fail to capture fine-grained clinical semantics and temporal dependencies. We introduce **LUNGUAGE** , a benchmark dataset of structured radiology reports that serves as a gold standard for evaluating structured report frameworks. It is designed to support comprehensive assessment of both single-report interpretation and longitudinal reasoning. Constructed from a subset of the MIMIC-CXR test set, LUNGUAGE comprises 1,473 chest X-ray reports from 230 patients, annotated with over 17,000 expert-verified entities and 23,000 relation-attribute pairs across 18 relation types. An additional subset of 80 sequential reports from 10 patients captures disease progression across 3 to 14 studies per patient, covering time intervals from 1 to 1,200 days. These are annotated with over 41,000 pairwise comparisons, grouped into semantically and temporally coherent groups. The dataset also includes a schema-aligned vocabulary covering diagnostic entities and attributes. All annotations were conducted and verified by board-certified radiologists, resulting in a clinically grounded resource for structured understanding and temporal reasoning in radiology.
提供机构:
PhysioNet
创建时间:
2025-12-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作