Supporting data for "SurGen: 1020 H&E-stained Whole Slide Images With Survival and Genetic Markers"
收藏DataCite Commons2025-06-17 更新2026-05-03 收录
下载链接:
http://gigadb.org/dataset/102725
下载链接
链接失效反馈官方服务:
资源简介:
Cancer remains one of the leading causes of morbidity and mortality worldwide. Comprehensive datasets that combine histopathological images with genetic and survival data across various tumour sites are essential for advancing computational pathology and personalised medicine. <br>We present SurGen, a dataset comprising 1,020 H&E stained whole slide images (WSIs) from 843 colorectal cancer cases. The dataset includes detailed annotations for key genetic mutations (KRAS, NRAS, BRAF) and mismatch repair status, as well as survival data for 426 cases. We illustrate SurGens utility with a proof-of-concept model that predicts mismatch-repair status directly from WSIs, achieving a test AUROC of 0.8316. These preliminary results underscore the datasets potential to facilitate research in biomarker discovery, prognostic modelling, and advanced machine learning applications in colorectal cancer and beyond. <br>SurGen offers a valuable resource for the scientific community, enabling studies that require high-quality WSIs linked with comprehensive clinical and genetic information on colorectal cancer. Our initial findings affirm the datasets capacity to advance diagnostic precision and foster the development of personalised treatment strategies in colorectal oncology.
提供机构:
GigaScience Database
创建时间:
2025-06-17



