five

Decomposed matrices used for the analysis described in 'Components of genetic associations across 2,138 phenotypes in the UK Biobank highlight adipocyte biology'

收藏
nih.figshare.com2023-05-30 更新2025-03-25 收录
下载链接:
https://nih.figshare.com/articles/dataset/Decomposed_matrices_used_for_the_analysis_described_in_Components_of_genetic_associations_across_2_138_phenotypes_in_the_UK_Biobank_highlight_adipocyte_biology_/9202247/1
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset deposited here contains decomposed matrices of GWAS summary statistics across 2,138 phenotypes described in the following publication:Y. Tanigawa*, J. Li*, et al., Components of genetic associations across 2,138 phenotypes in the UK Biobankhighlight adipocyte biology. Nature Communications (2019). doi:10.1038/s41467-019-11953-9.The data are provided as three Python Numpy data (npz) files, each of which corresponds to the three datasets used in computational analysis described in our manuscript.- "all" dataset: dev_allNonMHC_z_center_p0001_100PCs_20180129.npz- "Coding only" dataset: dev_codingNonMHC_z_center_p0001_100PCs_20180129.npz- "PTVs only" dataset: dev_PTVsNonMHC_z_center_p0001_100PCs_20180129.npzThose files can be loaded with Python numpy package and were used in our analysis scripts and notebook (https://github.com/rivas-lab/public-resources/tree/master/uk_biobank/DeGAs).Please read our publication for more information regarding this dataset.AbstractPopulation-based biobanks with genomic and dense phenotype data provide opportunities for generating effective therapeutic hypotheses and understanding the genomic role in disease predisposition. To characterize latent components of genetic associations, we applied truncated singular value decomposition (DeGAs) to matrices of summary statistics derived from genome-wide association analyses across 2,138 phenotypes measured in 337,199 White British individuals in the UK Biobank study. We systematically identified key components of genetic associations and the contributions of variants, genes, and phenotypes to each component. As an illustration of the utility of the approach to inform downstream experiments, we report putative loss of function variants, rs114285050 (GPR151) and rs150090666 (PDE3B), that substantially contribute to obesity-related traits, and experimentally demonstrate the role of these genes in adipocyte biology. Our approach to dissect components of genetic associations across the human phenome will accelerate biomedical hypothesis generation by providing insights on previously unexplored latent structures.

本处存档的数据集包含了对2,138个表型中GWAS总结统计量的分解矩阵,这些表型在以下出版物中被描述:Y. Tanigawa*,J. Li*等,《英国生物样本库中2,138个表型遗传关联的成分:突出脂肪细胞生物学》(Nature Communications,2019)。DOI:10.1038/s41467-019-11953-9。数据以三个Python Numpy数据(npz)文件提供,每个文件对应于我们论文中描述的计算分析所使用的三个数据集。- "all" 数据集:dev_allNonMHC_z_center_p0001_100PCs_20180129.npz- "Coding only
提供机构:
nih.figshare.com
二维码
社区交流群
二维码
科研交流群
商业服务