five

PathOlOgics_RBCs

收藏
DataCite Commons2025-06-01 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/PathOlOgics_RBCs/24116127/1
下载链接
链接失效反馈
官方服务:
资源简介:
The PathOlOgics_RBCs datasets are structured within two directories: "PathOlOgics_RBCs Patches" and "PathOlOgics_RBCs Cells" Inside the "PathOlOgics_RBCs Patches" directory, there exist 25 folders, each designated for the patches from every slide/smear, aggregating a total of 47,366 source images/patches. The count of patches within the 25 slides/smears folders is as follows: 1690, 1709, 2697, 2403, 1905, 1803, 2964, 3049, 2464, 2080, 1894, 1852, 2590, 2263, 3328, 983, 1066, 935, 1131, 1237, 1874, 1393, 1277, 1199 and 1580, respectively. The naming convention for each patch consists of two components: the corresponding slide/smear number followed by the unique patch number. The distribution of slides/smears to their respective staining and scanning sources is as follows: the first source comprises slides 1, 5, 6, 8, and 25; the second source includes slides 2, 3, 4, 7, 9, and 11; the third source encompasses slides 14, 15, 19, 20, 22, 23, and 24; and the fourth source consists of slides 10, 12, 13, 16, 17, 18, and 21.<br>The second root directory, "PathOlOgics_RBCs Cells", is systematically structured into three primary folders: "Cropped images”, "Masks”, and "Segmented images”. Within each of these primary folders, there are nine subfolders, meticulously dedicated to each RBCs class, encompassing the following counts of cells: "Angled cells: 24,187", "Borderline ovalocytes: 35,540”, "Burr cells: 8,948”, "Fragmented RBCs: 7,186”, "Ovalocytes: 55,348”, "Rounded RBCs: 46,346”, "Teardrops: 16,298”, "Three-overlapping RBCs: 15,577”, and "Two-overlapping RBCs: 31,360”. Each of the total 240,790 cells is represented by its own cropped image, mask, and segmented image, all subjected to meticulous processing and individual scrutiny by the hematologists directly, ensuring adherence to rigorous scientific standards. Samples for every class were collected from each slide/smear. The naming scheme for the cropped image, mask, and segmented image of every cell adheres to a consistent format, starting with the slide/smear number, followed by the unique patch number, and concluding with the XYWH notation, accurately specifying the position on the patch. All these images are conveniently stored in the ".jpg" format.

PathOlOgics_RBCs 数据集包含两个子目录:`PathOlOgics_RBCs 图像补丁集`与`PathOlOgics_RBCs 细胞样本集`。在`PathOlOgics_RBCs 图像补丁集`目录下,共设有25个文件夹,分别对应每张病理玻片/涂片的图像补丁,总计收录47366张源图像/补丁。25个玻片/涂片文件夹的补丁数量依次为:1690、1709、2697、2403、1905、1803、2964、3049、2464、2080、1894、1852、2590、2263、3328、983、1066、935、1131、1237、1874、1393、1277、1199及1580。每张图像补丁的命名规则由两部分组成:先标注对应的玻片/涂片编号,后接唯一的补丁编号。所有玻片/涂片按染色与扫描来源可分为四组:第一组包含玻片1、5、6、8及25;第二组包含玻片2、3、4、7、9及11;第三组包含玻片14、15、19、20、22、23及24;第四组包含玻片10、12、13、16、17、18及21。<br>第二个根目录`PathOlOgics_RBCs 细胞样本集`按系统架构划分为三个核心子目录:`裁剪图像`、`掩码图像`与`分割图像`。每个核心子目录下均设有9个专属子文件夹,分别对应一类红细胞(RBCs)样本,各类样本的细胞数量如下:成角红细胞(Angled cells)24187个、临界椭圆形红细胞(Borderline ovalocytes)35540个、棘形红细胞(Burr cells)8948个、破碎红细胞(Fragmented RBCs)7186个、椭圆形红细胞(Ovalocytes)55348个、圆形红细胞(Rounded RBCs)46346个、泪滴形红细胞(Teardrops)16298个、三重重叠红细胞(Three-overlapping RBCs)15577个、双重重叠红细胞(Two-overlapping RBCs)31360个。本次数据集总计收录240790个红细胞样本,每个样本均配有专属的裁剪图像、掩码图像与分割图像,所有图像均经过血液病理专家的精细处理与逐一核验,严格遵循严谨的科学标准。每一类样本均从全部玻片/涂片中采集获取。每个细胞的裁剪图像、掩码图像与分割图像采用统一命名格式:先标注玻片/涂片编号,后接唯一补丁编号,最后以XYWH标记符标明其在补丁图像中的精确位置。所有图像均以JPEG(.jpg)格式存储。
提供机构:
figshare
创建时间:
2023-09-10
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
PathOlOgics_RBCs数据集包含47,366个红细胞图像补丁和240,790个经过分类和标记的红细胞图像,适用于血液学研究和红细胞形态分析。数据集结构清晰,包含详细的命名和分类信息,便于科研使用。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作