medarc/TCGA-12K-litdata
收藏Hugging Face2025-10-27 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/medarc/TCGA-12K-litdata
下载链接
链接失效反馈官方服务:
资源简介:
## Attribution
This dataset contains 224 x 224 JPEG patches from whole-slide images originally downloaded from The Cancer Genome Atlas (TCGA) that are available in the NCI Genomic Data Commons (GDC) Open Access tier. We mirror and repackage a commonly used ~12k WSI subset in LitData format for ease of training. We exclude patches that did not pass HSV thresholding, following the procedure in Kaiko.AI's Midnight paper. Patches were randomly sampled across magnification levels. There are a total of 24,985,184 patches included.
**Primary source**
- The Cancer Genome Atlas Program (TCGA), National Cancer Institute and National Human Genome Research Institute
**Use and redistribution**
- These WSIs originate from the GDC Open Access tier. Redistribution of Open Access TCGA content is permitted.
- Do not attempt to re-identify participants. Follow NIH Genomic Data Sharing policy and GDC Open Access terms.
- Cite TCGA and the GDC in publications that use this dataset.
**License note**
- No additional license from us. This mirror follows TCGA and GDC Open Access terms.
提供机构:
medarc



