Arab Pangenome Reference
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13752608
下载链接
链接失效反馈官方服务:
资源简介:
Pangenomes represent a significant shift from relying on a single reference sequence to a robust set of assemblies, Arab populations remain significantly underrepresented; hence, we present the first Arab Pangenome Reference (APR) utilizing 53 individuals of diverse Arab ethnicities. We assembled nuclear and mitochondrial pangenomes using 35.27X high-fidelity long reads, 54.22X ultralong reads and 65.46X Hi-C reads yielded contiguous haplotype-phased de novo assemblies of exceptional quality, with an average N50 of 124.28 Mb. We discovered 111.96 million base pairs of novel euchromatic sequences absent from existing human pangenomes, the T2T-CHM13, GRCh38 reference human genomes, and other public datasets. We identified 8.94 million population-specific small variants and 235,195 structural variants within the Arab pangenome. We detected 883 gene duplications including 15.06% associated with recessive diseases and 1,436 bp of novel mitochondrial pangenome sequence. Our study provides a valuable resource for future genomic medicine initiatives in Arab population and other global populations.
创建时间:
2024-09-12



