RadVLM Instruction Dataset
收藏DataCite Commons2025-09-25 更新2026-05-04 收录
下载链接:
https://physionet.org/content/radvlm-instruction-dataset/1.0.0/
下载链接
链接失效反馈官方服务:
资源简介:
We release the RadVLM instruction dataset, a large-scale resource used to
train the RadVLM model on diverse radiology tasks. The dataset contains
1,115,021 image-instruction pairs spanning five task families: (i) report
generation from frontal CXRs using filtered Findings/Impression text; (ii)
abnormality classification for the standard 14 CheXpert labels; (iii) anatomy
grounding; (iv) abnormality detection and grounding; and (v) phrase grounding
from report sentences. To support interactive use, we include ~89k LLM-
generated multi-turn, multi-task conversations (~3k with spatial grounding)
derived from image-linked attributes (reports, labels, boxes). Creation
involved curating datasets from public sources, excluding lateral views,
removing prior-study references and other non-image context from reports,
fusing multi-reader annotations, and harmonizing label and coordinate formats.
The resource is intended for training CXR assistants across diverse radiology
tasks and within a conversational format.
提供机构:
PhysioNet
创建时间:
2025-09-12



