five

Linkage-Disequilibrium (LD) matrices for six continental ancestry groups from the UK Biobank

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14614206
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains Linkage Disequilibrium (LD) matrices for six ancestry groups from the UK Biobank. LD matrices record the SNP-by-SNP correlations in a given sample of individuals from the general population. In this case, we threshold the matrices so that we only record the correlations between variants in the same LD block (defined by LDetect). The continental ancestry groups are defined by the Pan-UKB initiative as: EUR = European ancestry (N=362446) CSA = Central/South Asian ancestry (N=8284) AFR = African ancestry (N=6255) EAS = East Asian ancestry (N=2700) MID = Middle Eastern ancestry (N=1567) AMR = Admixed American ancestry (N=987) The sample sizes here are restricted to unrelated individuals in the UK Biobank. The matrices were computed using  magenpy and quantized to int8 data type for better compressibility. The standard matrices (EUR.tar.gz, AFR.tar.gz, ...) contain pairwise correlations for 1.4 million HapMap3+ variants. For European samples, we also provide LD matrices that record pairwise correlations for up to 18 million variants (EUR_18m_variants.tar.gz) For more details on how these matrices were computed, please consult our manuscript: Towards whole-genome inference of polygenic scores with fast and memory-efficient algorithmsShadi Zabad, Chirayu Anant Haryan, Simon Gravel, Sanchit Misra, Yue Li To access these matrices, consult the codebase of magenpy, our custom python package with special data structures for processing these LD matrices.
创建时间:
2025-01-08
二维码
社区交流群
二维码
科研交流群
商业服务