five

UNN-LC High-Resolution Histopathological Lung Tissue Patch Dataset

收藏
DataONE2024-05-15 更新2024-10-12 收录
下载链接:
https://search.dataone.org/view/sha256:5ea1bbe66e7330a3cce1a04b1e2b7d25566ef86990310d8e45df39dd09739778
下载链接
链接失效反馈
官方服务:
资源简介:
The UNN-LC High-Resolution Histopathological Lung Tissue Patch Dataset is a collection of image patches designed for computational prognostic evaluation of lung cancer. Compiled from a subset of 194 whole-slide images (WSIs) from the University Hospital of North Norway, this dataset provides a comprehensive representation of various lung tissue conditions. Each 768 x 768 pixel patch contributes to a detailed analysis of tissue morphology. The dataset was annotated by an oncologist (Thomas Kilvær) and a pathologist (Stig Dalen) with a concerted effort to minimize selection and labeling biases. Specifically, patches with predominantly cancer cells, including tumor-infiltrating lymphocytes, were annotated by Stig Dalen. Thomas Kilvær provided annotations for patches representing normal lung tissue. The combined efforts of Stig Dalen and Thomas Kilvær resulted in the annotations for the reactive stroma with tertiary lymphoid structures and necrosis areas data. Annotations were acquired using QuPath software and a custom-developed annotation tool. The dataset categorizes patches into four classes: necrosis, tumor, stroma_tls, and normal_lung. The necrosis class includes patches of tissue associated with tumor regions, while the normal lung class represents areas of healthy lung tissue, inclusive of stromal components. The stroma_tls class is characterized by patches of reactive stroma with dense tissue and lymphocyte aggregates. The tumor tissue class comprises patches with a predominant presence of tumor content and may also include areas with tumor-infiltrating lymphocytes (TILs). For those interested in further expanding the scope and improving the balance of classes within the dataset, additional patches from the LC25000 dataset can be integrated for a more diverse representation of tissue conditions. This approach can enhance the robustness of computational models developed using this data. The dataset is divided into training and testing sets to facilitate and promote reproducibility in the development and validation of vision models. The training set includes a selection of patches from each class, while the testing set is composed of the remaining patches to ensure a comprehensive assessment of model performance.
创建时间:
2024-09-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作