five

Arab Pangenome Reference

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13752608
下载链接
链接失效反馈
官方服务:
资源简介:
Pangenomes represent a significant shift from relying on a single reference sequence to a robust set of assemblies, Arab populations remain significantly underrepresented; hence, we present the first Arab Pangenome Reference (APR) utilizing 53 individuals of diverse Arab ethnicities. We assembled nuclear and mitochondrial pangenomes using 35.27X high-fidelity long reads, 54.22X ultralong reads and 65.46X Hi-C reads yielded contiguous haplotype-phased de novo assemblies of exceptional quality, with an average N50 of 124.28 Mb. We discovered 111.96 million base pairs of novel euchromatic sequences absent from existing human pangenomes, the T2T-CHM13, GRCh38 reference human genomes, and other public datasets. We identified 8.94 million population-specific small variants and 235,195 structural variants within the Arab pangenome. We detected 883 gene duplications including 15.06% associated with recessive diseases and 1,436 bp of novel mitochondrial pangenome sequence. Our study provides a valuable resource for future genomic medicine initiatives in Arab population and other global populations.
创建时间:
2024-09-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作