结核病中3个独立汉族人群全基因组关联研究分析
收藏国家基础学科公共科学数据中心2024-03-05 收录
下载链接:
https://www.nbsdc.cn/general/dataDetail?id=64ef83d4bb16e0591d024c93&type=1
下载链接
链接失效反馈官方服务:
资源简介:
结核病中3个独立汉族人群全基因组关联研究分析主要面向调控病原菌感染致病的新型宿主免疫蛋白分子机制研究,为了确定结核病的易感位点,我们在中国汉族人群中进行了三阶段的结核病GWAS。我们使用Affymetrix Axiom CHB阵列对2112名受试者进行了全基因组基因分型(发现阶段)。在进行质量控制(QC)和插补后,分析了833例肺结核病例和1220例对照中总计5374021个次要等位基因频率(MAF)为3%或更高的变异。我们利用主成分分析(PCA)检测样本中的潜在人群分层,并在发现阶段对前三个主成分进行逻辑回归分析。PCA调整后,我们没有观察到测试统计数据的显著膨胀(膨胀因子λ = 1.01),表明潜在人口分层的影响得到了很好的控制。本数据包含以上样本的GWAS摘要统计数据,数据量251.18MB。
This three-stage genome-wide association study (GWAS) of tuberculosis involving three independent Han Chinese populations primarily aims to investigate the molecular mechanisms of novel host immune proteins that regulate pathogen infection and pathogenicity. To identify susceptibility loci for tuberculosis, we conducted this three-stage GWAS among the Han Chinese population. In the discovery stage, genome-wide genotyping was performed for 2112 subjects using the Affymetrix Axiom CHB array. Following quality control (QC) and imputation, a total of 5,374,021 variants with a minor allele frequency (MAF) ≥ 3% were analyzed across 833 pulmonary tuberculosis cases and 1220 controls. We utilized principal component analysis (PCA) to detect potential population stratification in the samples, and implemented logistic regression analysis on the top three principal components in the discovery stage. After PCA adjustment, no significant inflation of test statistics was observed (inflation factor λ = 1.01), indicating that the confounding effect of potential population stratification was well controlled. This dataset contains the GWAS summary statistics of the aforementioned samples, with a total size of 251.18 MB.
提供机构:
同济大学
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集聚焦于结核病易感位点的研究,通过对中国汉族人群进行三阶段全基因组关联分析(GWAS),提供了2112名受试者的基因分型数据和GWAS摘要统计数据。数据量为251.18MB,包含2个文件,适用于病原菌感染与宿主免疫机制的研究。
以上内容由遇见数据集搜集并总结生成



