five

Preprocessed TCGA Breast Invasive Carcinoma Multi-Omics Dataset with Survival Annotations

收藏
DataONE2025-12-01 更新2025-12-20 收录
下载链接:
https://search.dataone.org/view/sha256:3a6224466f9b1bb2b5c9fc0ab31ac6cd706f90f2e0891af838ade0faa4af3b73
下载链接
链接失效反馈
官方服务:
资源简介:
Preprocessed multi-omics dataset from TCGA Breast Invasive Carcinoma (BRCA), comprising RNA-seq gene expression, DNA methylation, and copy number variation data for 710 patients across 16,163 genes. The dataset underwent comprehensive preprocessing and quality control, including ComBat batch correction (55% reduction in technical variance), quantile normalization and log-transformation for expression data, β-value to M-value transformation for methylation data, and KNN-based imputation for missing values. All three omics layers are gene-aligned and biologically validated through expected cross-omics correlations. The dataset is fully analysis-ready and suitable for downstream machine learning tasks such as survival prediction, molecular subtyping, and integrative multi-omics studies.
创建时间:
2025-12-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作