【我遇到的问题】 • 现象:该数据集的下载链接已失效 【相关信息】 • 可考虑访问这个链接获取类似文件~https://www.selectdataset.com/dataset/3688356173feccbcf1f1e490ddc6bc72
Elsafty_RBCs_for_Classification
收藏DataCite Commons2025-06-01 更新2024-08-19 收录
下载链接:
https://figshare.com/articles/dataset/Elsafty_RBCs_for_Classification/25599981/1
下载链接
链接失效反馈官方服务:
资源简介:
This Elsafty_RBCs_for_Classification dataset is systematically structured into three primary folders: "Cropped images”, "Masks”, and "Segmented images”. Within each of these primary folders, there are nine subfolders, meticulously dedicated to each RBCs class, encompassing the following counts of cells: "Angled cells: 24,187", "Borderline ovalocytes: 35,540”, "Burr cells: 8,948”, "Fragmented RBCs: 7,186”, "Ovalocytes: 55,073”, "Rounded RBCs: 46,338”, "Teardrops: 16,298”, "Three-overlapping RBCs: 15,577”, and "Two-overlapping RBCs: 31,360”. Each of the total 240,507 cells is represented by its own cropped image, mask, and segmented image. Samples for every class were collected from each slide/patient. The naming scheme for the cropped image, mask, and segmented image of every cell adheres to a consistent format, starting with the slide/patient number, followed by the unique patch/field number, and concluding with the (XYWH) coordination on the patch. All these images are conveniently stored in the lossless ".PNG" format.The presence of fragmented RBCs or teardrop-shaped RBCs is medically significant as it is commonly associated with serious medical conditions. Fragmented RBCs are defined as RBCs that are smaller than half the average normal/rounded RBCs size and/or irregularly shaped fragments with sharp, angular, or jagged edges. Identifying these cells is the most reliable indicator to confirm the diagnosis of diseases such as hemolytic anemias, thrombotic thrombocytopenic purpura (TTP), and disseminated intravascular coagulation (DIC). However, reporting fragmented RBCs in TTP and DIC can be a challenge due to their infrequency in hematology labs; furthermore, the cutoff for significant presence in these two serious diseases is just above 1–1.5% of the total RBCs, increasing the risk of overlooking them. Crucially, in cases of critical thrombocytopenia where the platelet count is less than 20 K/µL, platelet transfusion may be necessary, but this intervention can be life-threatening in TTP and DIC. Therefore, identifying and counting fragmented RBCs could be critical for the accurate diagnosis and management of patients with associated medical conditions.Increased teardrop-shaped RBCs above 2–4% in adults can be indicative of bone marrow fibrosis caused by bone marrow cancers, and in non-cancerous conditions, rushed erythropoiesis/production of blood to compensate for severe anemia is the differential diagnosis. Currently, manual or DL-based visual examination is the only way to identify teardrop-shaped RBCs. It is essential to differentiate between true teardrop-shaped RBCs, which have a single blunt protrusion, and false ones that have sharp surface projections without necks or have more than one blunt protrusion. Mechanical stress during blood smear preparation often leads to the formation of false teardrop shapes, primarily at the outer edges of the blood film.Ovalocytes are a type of RBCs that have an abnormal oval shape. The presence of ovalocytes exceeding 5–10% of the total RBCs is associated with almost all types of anemia or erythrocytosis. They may display elongation and/or a pear shape, but without any blunt or sharp surface protrusions. Occasionally, they can also appear in normal blood smears due to mechanical deformation during preparation, though at a low frequency.The burr cells have uneven surfaces with several small notches and protrusions. Likewise, no technological substitute currently exists for the visual recognition of burr cells, which tend to elevate under conditions of dehydration, such as in cases of renal failure or dehydrated neonates. Alternatively, in situations lacking medical justification, the presence of burr cells may arise due to the extended drying of smears during the manual staining procedure.
Elsafty_RBCs_for_Classification 数据集采用系统化架构,分为三个核心文件夹:"裁剪图像(Cropped images)"、"掩码(Masks)"与"分割图像(Segmented images)"。每个主文件夹下均包含9个子文件夹,分别对应每一类红细胞(Red Blood Cells, RBCs),各类细胞的数量如下:成角红细胞(Angled cells):24187个,边界椭圆形红细胞(Borderline ovalocytes):35540个,刺状红细胞(Burr cells):8948个,破碎红细胞(Fragmented RBCs):7186个,椭圆形红细胞(Ovalocytes):55073个,圆形红细胞(Rounded RBCs):46338个,泪滴形红细胞(Teardrops):16298个,三重重叠红细胞(Three-overlapping RBCs):15577个,双重重叠红细胞(Two-overlapping RBCs):31360个。本数据集总计包含240507个细胞,每个细胞均对应一张专属的裁剪图像、掩码与分割图像。
所有类别的样本均采集自每一张玻片/患者。每个细胞的裁剪图像、掩码及分割图像遵循统一命名格式:以玻片/患者编号作为起始标识,紧随唯一的视野(patch/field)编号,最后标注该视野内的(XYWH)坐标。所有图像均采用无损PNG(.PNG)格式存储。
破碎红细胞或泪滴形红细胞具有重要临床意义,通常与多种严重疾病相关。破碎红细胞的定义为:尺寸小于正常圆形红细胞平均尺寸一半的红细胞,和/或带有尖锐棱角、锯齿状边缘的不规则碎片。识别此类细胞是确诊溶血性贫血、血栓性血小板减少性紫癜(Thrombotic Thrombocytopenic Purpura, TTP)及弥散性血管内凝血(Disseminated Intravascular Coagulation, DIC)等疾病的最可靠指标。然而,在血液学实验室中,TTP与DIC患者的破碎红细胞检出率较低,因此报告此类细胞存在一定难度;此外,这两类重症疾病中具有临床意义的破碎红细胞占总红细胞的比例阈值仅为1%~1.5%以上,进一步提升了漏检风险。
尤为关键的是,当出现血小板计数低于20 K/µL的重症血小板减少症时,临床通常需要进行血小板输注,但在TTP与DIC患者中,该干预措施可能危及生命。因此,识别并计数破碎红细胞对于相关疾病的精准诊断与患者管理至关重要。
成人泪滴形红细胞占总红细胞比例超过2%~4%时,可提示骨髓纤维化(由骨髓恶性肿瘤引发);在非恶性疾病中,鉴别诊断需考虑为代偿严重贫血而出现的加速红细胞生成/血液生成过程。目前,识别泪滴形红细胞仅能通过人工目视检查或基于深度学习(Deep Learning, DL)的视觉检测实现。需严格区分真性泪滴形红细胞(仅带有单个钝性突起)与假性泪滴形红细胞(无颈部的尖锐表面突起,或存在多个钝性突起)。血涂片制备过程中的机械应力常导致假性泪滴形红细胞形成,此类细胞多分布于血膜的外缘区域。
椭圆形红细胞为一类形态异常的椭圆形红细胞。当椭圆形红细胞占总红细胞比例超过5%~10%时,几乎与各类贫血或红细胞增多症相关。此类细胞可表现为伸长形态和/或梨形,但无任何钝性或尖锐的表面突起。偶尔,血涂片制备过程中的机械变形也可导致其出现,但发生率较低。
刺状红细胞的表面凹凸不平,带有多个小型切迹与突起。目前同样无技术手段可替代目视识别刺状红细胞,此类细胞在脱水状态下(如肾衰竭或脱水新生儿)比例会升高。此外,若缺乏明确医学指征,手动染色过程中涂片长时间干燥也可诱导刺状红细胞的出现。
提供机构:
figshare
创建时间:
2024-04-22
搜集汇总
数据集介绍

背景与挑战
背景概述
Elsafty_RBCs_for_Classification数据集是一个包含24万多个红细胞图像的医学数据集,分为9个类别,用于红细胞的分类和形态分析。数据集提供了裁剪图像、掩码和分割图像,适用于深度学习和医学图像处理研究,尤其关注与严重血液疾病相关的红细胞形态。
以上内容由遇见数据集搜集并总结生成



