five

Table2_Genome-wide imputed differential expression enrichment analysis identifies trait-relevant tissues.XLSX

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://figshare.com/articles/dataset/Table2_Genome-wide_imputed_differential_expression_enrichment_analysis_identifies_trait-relevant_tissues_XLSX/21842742
下载链接
链接失效反馈
官方服务:
资源简介:
The identification of pathogenically-relevant genes and tissues for complex traits can be a difficult task. We developed an approach named genome-wide imputed differential expression enrichment (GIDEE), to prioritise trait-relevant tissues by combining genome-wide association study (GWAS) summary statistic data with tissue-specific expression quantitative trait loci (eQTL) data from 49 GTEx tissues. Our GIDEE approach analyses robustly imputed gene expression and tests for enrichment of differentially expressed genes in each tissue. Two tests (mean squared z-score and empirical Brown’s method) utilise the full distribution of differential expression p-values across all genes, while two binomial tests assess the proportion of genes with tissue-wide significant differential expression. GIDEE was applied to nine training datasets with known trait-relevant tissues and ranked 49 GTEx tissues using the individual and combined enrichment tests. The best-performing enrichment test produced an average rank of 1.55 out of 49 for the known trait-relevant tissue across the nine training datasets—ranking the correct tissue first five times, second three times, and third once. Subsequent application of the GIDEE approach to 20 test datasets—whose pathogenic tissues or cell types are uncertain or unknown—provided important prioritisation of tissues relevant to the trait’s regulatory architecture. GIDEE prioritisation may thus help identify both pathogenic tissues and suitable proxy tissue/cell models (e.g., using enriched tissues/cells that are more easily accessible). The application of our GIDEE approach to GWAS datasets will facilitate follow-up in silico and in vitro research to determine the functional consequence(s) of their risk loci.

识别复杂性状的致病相关基因与组织往往是一项极具挑战性的任务。我们开发了一种名为全基因组估算差异表达富集(genome-wide imputed differential expression enrichment, GIDEE)的方法,通过整合全基因组关联研究(Genome-Wide Association Study, GWAS)汇总统计数据与来自49个GTEx组织的组织特异性表达数量性状基因座(expression quantitative trait loci, eQTL)数据,对性状相关组织进行优先级排序。我们的GIDEE方法可对经稳健估算得到的基因表达进行分析,并检验各组织中差异表达基因的富集情况。其中两种检验方法(均方z得分法与经验布朗法)利用了所有基因的差异表达p值的完整分布,而另外两种二项式检验则针对具有组织水平显著差异表达的基因比例进行评估。我们将GIDEE应用于9个带有已知性状相关组织的训练数据集,并通过单独及联合富集检验对49个GTEx组织进行排序。在这9个训练数据集中,表现最佳的富集检验将已知性状相关组织的平均排名为1.55/49——其中5次将正确组织排在首位,3次排在第二位,1次排在第三位。随后我们将GIDEE方法应用于20个测试数据集——这些数据集的致病组织或细胞类型尚不明确——为与性状调控架构相关的组织提供了重要的优先级排序结果。因此,GIDEE的优先级排序结果可助力识别致病组织与合适的替代组织/细胞模型(例如使用更易于获取的富集组织/细胞)。将我们的GIDEE方法应用于GWAS数据集,将有助于推动后续的计算机模拟与体外实验研究,以明确其风险位点的功能效应。
创建时间:
2023-01-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作