five

De novo human genome assemblies across populations

收藏
NIAID Data Ecosystem2026-04-25 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/SRP133412
下载链接
链接失效反馈
官方服务:
资源简介:
The human genome reference is an integral part of modern biological research. However, a single consensus genome representation is inadequate to provide a universal reference structure due to its lack of sequence variation and structural diversity. Using 10x Genomics “linked-read” technology, we performed whole genome sequencing and de novo assembly on 18 individuals across five different populations. Here, we identified 1,872 breakpoint-resolved Novel Sequence Insertions (NSIs), and accumulatively they added up to 2.1Mb of new genomic content not currently represented in the human genome reference. These NSIs adhere to an underlying population structure, as principal component analysis can faithfully stratify these individuals based on their genetic backgrounds. Furthermore, over half of these NSIs aligned to nonhuman primates, suggesting that they are ancestral to humans. While investigating their origins, we discovered that a subset of NSIs were results of Alu-mediated recombination. Additionally, some of the NSIs aligned to either entries in the expressed sequence tag database or reads from RNA-sequencing experiments, suggesting that they bear sequences that are transcribed. Our results demonstrate that as the genomic variant catalogs continue to expand, a more comprehensive set of alternative haplotypes are necessary to depict the complete spectrum of genetic diversity across populations.
创建时间:
2020-08-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作