DICOM converted Slide Microscopy images for the TCGA-PAAD collection
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12689972
下载链接
链接失效反馈官方服务:
资源简介:
This dataset corresponds to a collection of images and/or image-derived data available from National Cancer Institute
Imaging Data Commons (IDC) [1]. This dataset was converted
into DICOM representation and ingested by the IDC team. You can explore and visualize the corresponding images using
IDC Portal here: TCGA-PAAD. You can use the manifests included in this
Zenodo record to download the content of the collection following the Download instructions below.
Collection description
Pancreatic ductal adenocarcinoma is the most common form of pancreatic cancer, making up more than 80% of cases. The disease begins in the cells of the pancreas's ducts, which transport juices containing digestive enzymes into the small intestine.
Pancreatic cancer is the fourth most common cause of global cancer-related deaths and is almost always fatal. In 2012, it was estimated that around 44,000 new cases of pancreatic cancer were diagnosed and more than 37,000 deaths from this disease occurred in the United States alone, affecting both men and women.
Please see the TCGA-PAAD information page to learn more about the images and to obtain any supporting metadata for this collection.
Citation guidelines can be found on the Citing TCGA in Publications and Presentations information page.
Files included
A manifest file's name indicates the IDC data release in which a version of collection data was first introduced.
For example, collection_id-idc_v8-aws.s5cmd corresponds to the contents of the
collection_id collection introduced in IDC data
release v8. If there is a subsequent version of this Zenodo page, it will indicate when a subsequent version of
the corresponding collection was introduced.
tcga_paad-idc_v8-aws.s5cmd: manifest of files available for download from public IDC Amazon Web Services buckets
tcga_paad-idc_v8-gcs.s5cmd: manifest of files available for download from public IDC Google Cloud Storage buckets
tcga_paad-idc_v8-dcf.dcf: Gen3 manifest (for details see https://learn.canceridc.dev/data/organization-of-data/guids-and-uuids)
Note that manifest files that end in -aws.s5cmd reference files stored in Amazon Web Services (AWS) buckets, while -gcs.s5cmd reference
files in Google Cloud Storage. The actual files are identical and are mirrored between AWS and GCP.
Download instructions
Each of the manifests include instructions in the header on how to download the included files.
To download the files using .s5cmd manifests:
install idc-index package: pip install --upgrade idc-index
download the files referenced by manifests included in this dataset by passing the .s5cmd manifest file: idc download manifest.s5cmd.
To download the files using .dcf manifest, see manifest header.
Acknowledgments
Imaging Data Commons team has been funded in whole or in part with Federal funds from the National Cancer Institute,
National Institutes of Health, under Task Order No. HHSN26110071 under Contract No. HHSN261201500003l.
References
[1] Fedorov, A., Longabaugh, W. J. R., Pot, D., Clunie, D. A., Pieper, S. D., Gibbs, D. L., Bridge, C., Herrmann,
M. D., Homeyer, A., Lewis, R., Aerts, H. J. W., Krishnaswamy, D., Thiriveedhi, V. K., Ciausu, C., Schacherer,
D. P., Bontempi, D., Pihl, T., Wagner, U., Farahani, K., Kim, E. & Kikinis, R.
National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in
Imaging Artificial Intelligence. RadioGraphics (2023). https://doi.org/10.1148/rg.230180
创建时间:
2024-08-20



