Preprocessed TCGA Breast Invasive Carcinoma Multi-Omics Dataset with Survival Annotations
收藏DataONE2025-12-01 更新2025-12-20 收录
下载链接:
https://search.dataone.org/view/sha256:3a6224466f9b1bb2b5c9fc0ab31ac6cd706f90f2e0891af838ade0faa4af3b73
下载链接
链接失效反馈官方服务:
资源简介:
Preprocessed multi-omics dataset from TCGA Breast Invasive Carcinoma (BRCA), comprising RNA-seq gene expression, DNA methylation, and copy number variation data for 710 patients across 16,163 genes. The dataset underwent comprehensive preprocessing and quality control, including ComBat batch correction (55% reduction in technical variance), quantile normalization and log-transformation for expression data, β-value to M-value transformation for methylation data, and KNN-based imputation for missing values. All three omics layers are gene-aligned and biologically validated through expected cross-omics correlations. The dataset is fully analysis-ready and suitable for downstream machine learning tasks such as survival prediction, molecular subtyping, and integrative multi-omics studies.
创建时间:
2025-12-04



