five

Long-read de novo assembly of the Plasmodium falciparum genome using single molecule real-time sequencing

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/ERP013212
下载链接
链接失效反馈
官方服务:
资源简介:
The application of next generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness (~80.6% (A+T)) of its genome and the lack of technology to assemble highly polymorphic sub-telomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12 kb, with 50% of the reads between 15.5 and 50 kb in length. Next, using hierarchical genome assembly process, we assembled the P. falciparum genome de novo, and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres (~90-99% (A+T)) and sub-telomeric regions, and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive sub-telomeric sequences of parasite populations from endemic areas, but may also evaluate structural variation linked to virulence, drug resistance and disease transmission.
创建时间:
2018-02-21
二维码
社区交流群
二维码
科研交流群
商业服务