five

NODE21

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/4725880
下载链接
链接失效反馈
官方服务:
资源简介:
We provide a NODE21 public CXR training dataset. This dataset consists of 1134 frontal chest radiographs with annotated bounding boxes around nodules (1476 nodules). The images in this set are from public datasets that allow us to remix and redistribute. The images from this dataset come from the following sources: JSRT [1] PadChest [2] Chestx-ray14 [3] Open-I [4]  The annotations were taken from the data sources and checked by our chest radiologists, or they were provided by our chest radiologists. We provide both original and preprocessed version of the dataset. This dataset is named training_data.zip.  training_data.zip file contains a folder called original and preprocessed and associated CSV files metadata_preprocessed.csv and metadata_original.csv, respectively. The folder preprocessed contains 1134 chest X-ray images normalized and resized to 1024 x 1024 whereas the folder original contains images without any preprocessing, all of them were saved as mha format.  metadata_preprocessed.csv and metadata_original.csv  contain the bounding box locations associated with each image file. Additionally, for the generation track, we provide a public set of NODE21 CT patches (see luna16.zip). These are patches of nodules from CT scans. The patches are 50 x 50 x 50 mm resampled to voxels of 1 x 1 x 1 mm. The patches originate from the LUNA16 dataset [5][6]. These patches can be used to create artificial nodules in given chest radiographs.   [1] Shiraishi, J., Katsuragawa, S., Ikezoe, J., Matsumoto, T., Kobayashi, T., Komatsu, K.i., Matsui, M., Fujita, H., Kodera, Y., Doi, K., 2000. Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules. American Journal of Roentgenology 174, 71–74. doi:10.2214/ajr.174.1.1740071. [2] Bustos, A., Pertusa, A., Salinas, J.M., de la Iglesia-Vaya, M., 2020. PadChest: ´ A large chest x-ray image dataset with multi-label annotated reports. Medical Image Analysis 66, 101797. doi:10.1016/j.media.2020.101797. [3] Wang, X., Peng, Y., Lu, L., Lu, Z., Bagheri, M., Summers, R.M., 2017b. Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases, in IEEE Conference on Computer Vision and Pattern Recognition, pp. 2097–2106. doi:10.1109/cvpr.2017.369. [4] Demner-Fushman, D., Antani, S., Simpson, M., Thoma, G.R., 2012. Design and Development of a Multimodal Biomedical Information Retrieval System. Journal of Computing Science and Engineering 6, 168–177. doi:10.5626/JCSE.2012.6.2.168. [5] Andrey Fedorov, Matthew Hancock, David Clunie, Mathias Brochhausen, Jonathan Bona, Justin Kirby, John Freymann, Steve Pieper, Hugo Aerts, Ron Kikinis1, Fred Prior, 2019. Standardized representation of the LIDC annotations using DICOM. The Cancer Imaging Archive. doi: 10.7937/TCIA.2018.H7UMFURQ [6] Setio et al., Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images:: The LUNA16 challenge, Medical Image Analysis 42, doi:: 10.1016/j.media.2017.06.015
创建时间:
2021-10-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作