five

Preprocessed TCGA Breast Invasive Carcinoma Multi-Omics Dataset with Survival Annotations

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://doi.org/10.7910/DVN/G2XQPI
下载链接
链接失效反馈
官方服务:
资源简介:
Preprocessed multi-omics dataset from TCGA Breast Invasive Carcinoma (BRCA), comprising RNA-seq gene expression, DNA methylation, and copy number variation data for 710 patients across 16,163 genes. The dataset underwent comprehensive preprocessing and quality control, including ComBat batch correction (55% reduction in technical variance), quantile normalization and log-transformation for expression data, β-value to M-value transformation for methylation data, and KNN-based imputation for missing values. All three omics layers are gene-aligned and biologically validated through expected cross-omics correlations. The dataset is fully analysis-ready and suitable for downstream machine learning tasks such as survival prediction, molecular subtyping, and integrative multi-omics studies.
创建时间:
2025-12-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作