five

Weakly-Supervised Learning Significantly Reduces the Number of Labels Required for Intracranial Hemorrhage Detection on Head CT

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7363181
下载链接
链接失效反馈
官方服务:
资源简介:
Modern machine learning pipelines, in particular those based on deep learning (DL) models, require large amounts of labeled data. For classification problems, the most common learning paradigm consists of presenting labeled examples during training, thus providing \emph{strong supervision} by directly presenting examples from the different classes, e.g. positive and negative samples. As a result, the adequate training of these models demands the curation of large datasets with high-quality labels. This constitutes a major obstacle for the development of DL models in radiology---in particular for cross-sectional imaging (e.g., computed tomography [CT] scans)---where labels must come from manual annotations by expert radiologists at the image or slice-level (as opposed to the examination level, such as could be obtained using natural language processing of radiology reports).  This work studies the question of what kind of labels should be collected for the problem of intracranial hemorrhage detection in brain CT. We investigate whether image-level annotations should be preferred to examination-level ones. By framing this task as a Multiple Instance Learning (MIL) problem, and employing modern attention-based DL architectures, we analyze the degree to which different levels of supervision improve the detection performance. We find that strong supervision (learning with local image-level annotations) and weak supervision (learning with only global examination-level labels) achieve comparable performance in both examination- and image-level hemorrhage detection, as well as in hemorrhage localization at the pixel-level (explainability). Furthermore, we study this behavior as a function of the number of labels available during training. Our results suggest that local labels may not be necessary at all, drastically reducing the time and cost involved in collecting and curating datasets.
创建时间:
2022-11-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作