five

HISTOPANTUME: Histological Pan-cancer Tumor image dataset

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14555793
下载链接
链接失效反馈
官方服务:
资源简介:
HISTOPANTUM is a comprehensive pan-cancer dataset of 140,569 histology images categorized into Tumor and Non-Tumor classes over 4 different cancer types (domains). This dataset is designed to facilitate domain generalization analysis for tumor detection tasks, serving as a benchmark for foundation models and domain generalization algorithms. Dataset Overview The dataset comprises histology images sourced from The Cancer Genome Atlas (TCGA), spanning the following four cancer types: Colorectal Cancer Ovarian Cancer Stomach Cancer Uterus Cancer Image Specifications Original Resolution: 512 × 512 pixels images are extracted from 0.5 micron-per-pixel resolution. Processed Size: Images are resized to 224 × 224 pixels and saved as JPEG files. The dataset is provided in four zipped files, each corresponding to one cancer type. Within each zip file, images are organized into two subfolders: tumour non-tumour Each image filename encodes the originating slide and the patch position within the slide, following this naming convention: __.jpg Citation If you use this dataset in your research, please cite the following publication: @article{zamanitajeddin2024benchmarking, title={Benchmarking Domain Generalization Algorithms in Computational Pathology}, author={Zamanitajeddin, Neda and Jahanifar, Mostafa and Xu, Kesi and Siraj, Fouzia and Rajpoot, Nasir}, journal={arXiv preprint arXiv:2409.17063}, year={2024} } For further details, please refer to the linked publication.
创建时间:
2024-12-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作