five

andreped/IBDColEpi

收藏
Hugging Face2023-11-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/andreped/IBDColEpi
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - image-segmentation language: - en tags: - medical pretty_name: IBDColEpi size_categories: - 1B<n<10B --- # IBDColEpi: 140 HE and 111 CD3-stained colon biopsies of active and inactivate inflammatory bowel disease with epithelium annotated To access and work with the data in Python, you can do so through the Python API with datasets. See this Jupyter Notebook on how to get started: https://github.com/andreped/NoCodeSeg/blob/main/notebooks/IBDColEpi-load-dataset-example.ipynb Note that it is also possible to download the data through the web interface at Hugging Face, but also through [this google drive](https://drive.google.com/drive/u/0/folders/1eUVs1DA1UYayUYjr8_aY3O5xDgV1uLvH) and [this dataverseNO](https://dataverse.no/dataset.xhtml?persistentId=doi:10.18710/TLA01U) link. -------------------- GENERAL INFORMATION -------------------- 1. Title of Dataset: 140 HE and 111 CD3-stained colon biopsies of active and inactivate inflammatory bowel disease with epithelium annotated: the IBDColEpi dataset 2. DOI: https://doi.org/10.18710/TLA01U 3. Contact Information Name: André Pedersen Institution: NTNU Norwegian University of Science and Technology Email: andre.pedersen@ntnu.no ORCID: https://orcid.org/0000-0002-3637-953X 4. Contributors: See metadata field Contributor. 5. Kind of data: See metadata field Kind of Data. 6. Date of data collection/generation: See metadata field Date of Collection. 7. Geographic location: See metadata section Geographic Coverage. 8. Funding sources: See metadata section Grant Information. 9. Description of dataset: General description and ethics approvals: The dataset contains 140 HE and 111 CD3 stained, formalin fixed paraffin embedded (FFPE) biopsies of colonic mucosa. The biopsies were extracted from the NTNU/St. Olavs hospital, Trondheim University Hospital (Norway) biobank of patients with confirmed inflammatory bowel disease or healthy controls with gastrointestinal symptoms but no macroscopic- or microscopic disease. Inclusion and colonoscopies were performed at the Department of Gastroenterology and Hepatology at St. Olavs hospital, Trondheim University Hospital from 2007 to 2018. All patients gave written informed consent and ethical approvals were obtained from the Central Norway Regional Committee for Medical and Health Research Ethics (reference number 2013/212/REKMidt). Consent to publish the anonymized whole slide image (WSI) dataset was given by REKMidt in 2021. Each database ID number used in this study was changed to new anonymized IDs only containing the information “active” or “inactive” disease and whether the WSI has haematoxylin-eosin (HE) staining or CD3 immunostaining. The biopsies included in the biobank are sampled such that one biopsy from an unaffected/inactive area and one from an area affected/active area were included from each patient and given a separate ID number. Hence, two biopsies with different ID numbers can be from the same patient. "Active" is defined as the presence of intraepithelial granulocytes in one or more location in the biopsies. Still, the changes may be focal, hence majority of the epithelium may still lack intraepithelial granulocytes or other signs of active disease (crypt abscesses, granulation tissue, etc.). --------------------------- SHARING/ACCESS INFORMATION --------------------------- (See metadata record for dataset.) 1. Licenses/Restrictions: See Terms section. 2. Links to publications that cite or use the data: See metadata field Related Publication. 3. Links/relationships to related data sets: See metadata field Related Datasets. 4. Data sources: See metadata field Data Sources. 5. Recommended citation: See citation generated by repository. --------------------- DATA & FILE OVERVIEW --------------------- 1. File List: 00_README.txt trained-models.zip patch-dataset-CD3.zip patch-dataset-HE.zip qupath-project-annotations.zip TIFF-annotations.zip WSI_part_01.zip WSI_part_02.zip WSI_part_03.zip WSI_part_04.zip WSI_part_05.zip WSI_part_06.zip WSI_part_07.zip WSI_part_08.zip WSI_part_09.zip WSI_part_10.zip 2. Relationship between files, if important: - trained-models.zip: the best performing trained models (for both HE and CD3) on the images from WSI_part_*.zip using the manual delineations from TIFF-annotations.zip. - WSI_path_*.zip: the colon biopsies described in the metadata (1-10). For each ID, the active/inactive label Y is stored in the filename, with the format: "ID-X_Y.ndpi". - TIFF-annotations.zip: the corresponding annotations to the WSIs. The filenames of the annotations are in the same structure as the corresponding WSIs, with the format: "ID-X_Y.tiff". - patch-dataset-*.zip: the corresponding patch images and labels, split into train/validation/test sets, relevant for the evaluation of the design in the publication. Both for HE and CD3 - qupath-project-annotations.zip: the qupath project file, also containing the annotations of all WSIs, but can be directly read in QuPath (after renaming of WSI paths).
提供机构:
andreped
原始信息汇总

数据集概述

基本信息

  • 数据集名称: IBDColEpi
  • 数据集描述: 包含140个HE和111个CD3染色的结肠活检样本,这些样本来自活动性和非活动性炎症性肠病患者,具有上皮细胞注释。
  • 数据集大小: 1B<n<10B
  • 语言: 英语
  • 许可证: MIT
  • 任务类别: 图像分割
  • 标签: 医学

数据集内容

  • 样本来源: 从NTNU/St. Olavs医院,Trondheim大学医院(挪威)的生物库中提取,包括已确认的炎症性肠病患者和无宏观或微观疾病的健康控制者。
  • 样本收集时间: 2007年至2018年
  • 伦理批准: 获得中央挪威地区医疗和健康研究伦理委员会的批准(参考号2013/212/REKMidt)

数据文件

  • 文件列表:

    • 00_README.txt
    • trained-models.zip
    • patch-dataset-CD3.zip
    • patch-dataset-HE.zip
    • qupath-project-annotations.zip
    • TIFF-annotations.zip
    • WSI_part_01.zip 至 WSI_part_10.zip
  • 文件关系:

    • trained-models.zip: 包含在WSI_part_*.zip图像上使用TIFF-annotations.zip中的手动描绘训练的最佳模型。
    • WSI_part_*.zip: 包含描述在元数据中的结肠活检样本。每个ID的活跃/非活跃标签Y存储在文件名中。
    • TIFF-annotations.zip: 包含与WSIs相对应的注释。注释文件名与相应的WSIs结构相同。
    • patch-dataset-*.zip: 包含相应的补丁图像和标签,分为训练/验证/测试集,用于评估出版物中的设计。
    • qupath-project-annotations.zip: 包含所有WSIs的注释的QuPath项目文件。
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作