five

Gene expression matrix and clinical metadata for COPD, NSCLC, and NSCLC with COPD

收藏
DataCite Commons2025-09-05 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=c328cea70e0b4486b05589e58cf49f24
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset provides gene expression matrices and comprehensive clinical metadata for three patient groups: (i) COPD alone, (ii) NSCLC alone, and (iii) NSCLC with comorbid COPD. Raw RNA-seq profiles were downloaded from NCBI-GEO (accession numbers GSEXXXXX, GSEYYYYY, …), uniformly processed (batch-effect removal, duplicate removal, outlier filtering) and quantile-normalized to TPM followed by log2 transformation. The matrix contains 19 848 protein-coding genes (including 1 133 mitochondria-related genes) as rows and samples as columns. Clinical variables comprise 26 fields such as age, sex, smoking pack-years, FEV1 %, tumor stage, histology, etc. The clean, ready-to-use data enable differential expression, WGCNA, machine-learning classification/regression, and mitochondria-focused functional analyses aimed at identifying shared or disease-specific mitochondrial gene signatures underlying COPD–NSCLC comorbidity.
提供机构:
Science Data Bank
创建时间:
2025-09-05
二维码
社区交流群
二维码
科研交流群
商业服务