five

hCINAP expression in colorectal cancer

收藏
DataCite Commons2020-09-02 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/Data_availability_for_bioinformatics_docx/4737181/3
下载链接
链接失效反馈
官方服务:
资源简介:
The authors declare that the data analysis processes supporting the findings of this study are available within the article and its Supplementary Information files. The TCGA gene expression profile data, as recomputed based on gencode v23, were downloaded from UCSC Xena (http://xena.ucsc.edu/). The TCGA clinical data were downloaded from the GDC Data Portal (https://gdc-portal.nci.nih.gov/), with accession number phs000178.v9.p8 in dbGap. Supplementary Information: For analyzing the <i>hCINAP</i> expression in CRC, we downloaded the recomputed TCGA gene expression datasets for COAD and READ cancer types from the UCSC Xena (http://xena.ucsc.edu/). The gene model was based on gencode v23, and the expression unit is TPM (Transcript per million). The clinical data were downloaded from the GDC Data Portal (https://gdc-portal.nci.nih.gov/). For differential expression analysis, we compiled a selected sample set, including 367 tumor- and 51 normal-samples, in which each sample has information available for clinical variables such as gender, age and race (Supplementary Table1). For expression analysis by pathological stages, we only used those tumor samples with stage information (Supplementary Table1). The dataset used for profiling gene expression by CRC subtypes was compiled based on the results of consensus molecular subtypes (CMSs) described previously [PMID: 26457759] , containing 265 tumor samples (Supplementary Table1).

作者声明,本研究结果所依托的数据分析流程可在本文及其补充信息文件中获取。基于gencode v23重新计算的TCGA基因表达谱数据,从UCSC Xena(http://xena.ucsc.edu/)下载获取。TCGA临床数据从GDC数据门户(https://gdc-portal.nci.nih.gov/)下载,其dbGap登录号为phs000178.v9.p8。 补充信息:为分析结直肠癌(CRC)中hCINAP的表达情况,我们从UCSC Xena下载了针对COAD与READ癌种的重新计算版TCGA基因表达数据集。该数据集的基因模型基于gencode v23,表达单位为TPM(Transcript per million,每百万转录本数)。临床数据同样从GDC数据门户(https://gdc-portal.nci.nih.gov/)下载。 在差异表达分析中,我们构建了选定的样本集,包含367份肿瘤样本与51份正常样本,所有样本均具备性别、年龄、种族等临床变量信息(补充表1)。针对病理分期的表达分析,我们仅纳入了带有完整分期信息的肿瘤样本(补充表1)。 基于结直肠癌亚型的基因表达谱分析数据集,依托此前报道的共识分子亚型(CMSs)研究结果构建[PMID: 26457759],共包含265份肿瘤样本(补充表1)。
提供机构:
figshare
创建时间:
2017-03-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作