five

Curated Dataset for COVID-19 Posterior-Anterior Chest Radiography Images (X-Rays).

收藏
Mendeley Data2024-03-27 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/9xkhgts2s6
下载链接
链接失效反馈
官方服务:
资源简介:
This is a combined curated dataset of COVID-19 Chest X-ray images obtained by collating 15 publically available datasets as listed under the references section. The present dataset contains 1281 COVID-19 X-Rays, 3270 Normal X-Rays, 1656 viral-pneumonia X-Rays, and 3001 bacterial-pneumonia X-Rays. This dataset is developed as a part of the following research publication. "A deep-learning based multimodal system for Covid-19 diagnosis using breathing sounds and chest X-ray images" https://doi.org/10.1016/j.asoc.2021.107522 The collected datasets—as cited by this dataset—are combined to form an integrated repository. This integrated repository contains a total of 4558 COVID-19 X-Rays, 5403 Normal X-Rays, 4497 Viral pneumonia X-Rays, and 5768 bacterial pneumonia X-Rays. Out of which 1379 COVID-19 X-Rays, 1476 normal X-Rays, 2690 viral pneumonia X-Rays, and 2588 bacterial pneumonia X-Rays are found to be duplicates—based on the image similarities—and thus are removed. Inception V3 architecture is used to obtain the image embeddings, which is followed by the use of unsupervised learning algorithms based on cosine similarity distances. These distances are clustered and then visualized to find different categories of image defects which are listed below:— 1.Noise 2.Pixelated 3.Compressed 4.Medical Implants 5.Washed out image 6.Side View 7.CT (sliced) image 8.Aspect Ratio distortion / Cropped / Zoomed 9.Rotated Images 10.Images with annotations These clusters of defective images are removed during the curation process and a refined dataset is obtained which is available for download.
创建时间:
2024-01-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作