five

CXR-LT: Multi-Label Long-Tailed Classification on Chest X-Rays

收藏
DataCite Commons2025-03-19 更新2024-07-13 收录
下载链接:
https://physionet.org/content/cxr-lt-iccv-workshop-cvamd/
下载链接
链接失效反馈
官方服务:
资源简介:
Chest radiography presents a "long-tailed" distribution of findings, where a few diseases are common, but most are rare. Diagnosis is further complicated by its multi-label nature, as patients often exhibit multiple co-occurring findings. While recent research has attempted to address the long-tailed medical image classification problem, the interplay between class imbalance and label co-occurrence remains underexplored. The **CXR-LT 2024 challenge** builds on the success of CXR-LT 2023, expanding the dataset of 377,110 chest X-rays (CXRs) to 45 disease labels, including 19 new rare disease findings. This year 's challenge introduces three tasks: (i) long-tailed classification on a large, noisy test set, (ii) long-tailed classification on a manually annotated "gold standard" subset, and (iii) zero-shot generalization to five previously unseen disease findings. CXR-LT 2024 addresses critical challenges in long-tailed, multi-label, and zero-shot learning for medical imaging by synthesizing state-of-the-art solutions from the international research community. Further, our dataset contributions -- expanding disease coverage to better reflect real-world clinical settings -- offer a valuable resource for future research. This project contains labels from the CXR-LT 2024 and CXR-LT 2023 challenges, as well as a related subset used in the MICCAI 2023 paper, "How Does Pruning Impact Multi-Label Long-Tailed Learning?"

胸部放射影像(Chest radiography)的检出结果呈现“长尾(long-tailed)分布”:少数疾病较为常见,而绝大多数病症则属于罕见范畴。由于患者往往同时存在多种并发检出结果,胸部影像诊断的多标签(multi-label)特性进一步加剧了任务复杂度。尽管现有研究已尝试解决长尾医学图像分类问题,但类别不平衡与标签共现之间的交互机制仍未得到充分探索。**CXR-LT 2024挑战赛**承接CXR-LT 2023的成功经验,将原有的377,110张胸部X线(Chest X-rays, CXR)数据集拓展至45种疾病标签,其中包含19种新增的罕见病症检出结果。本届挑战赛设置三项任务:(i) 基于大规模带噪测试集的长尾分类任务;(ii) 基于人工标注“金标准”子集的长尾分类任务;(iii) 针对5种未见过的全新病症的零样本(Zero-shot)泛化任务。CXR-LT 2024挑战赛通过整合国际研究社群的前沿解决方案,针对性解决医学影像领域中长尾、多标签以及零样本学习的核心挑战。此外,本数据集通过拓展疾病覆盖范围以更贴合真实临床场景,为后续研究提供了极具价值的研究资源。本项目包含CXR-LT 2024与CXR-LT 2023挑战赛的标签数据,以及MICCAI 2023论文《剪枝如何影响多标签长尾学习?》中使用的相关子集。
提供机构:
PhysioNet
创建时间:
2023-06-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作