Data from: De novo sequencing and variant calling with nanopores using PoreSeq
收藏DataCite Commons2025-05-01 更新2025-05-10 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.84d4j
下载链接
链接失效反馈官方服务:
资源简介:
The accuracy of sequencing single DNA molecules with nanopores is
continually improving, but de novo genome sequencing and assembly using
only nanopore data remain challenging. Here we describe PoreSeq, an
algorithm that identifies and corrects errors in nanopore sequencing data
and improves the accuracy of de novo genome assembly with increasing
coverage depth. The approach relies on modeling the possible sources of
uncertainty that occur as DNA transits through the nanopore and finds the
sequence that best explains multiple reads of the same region. PoreSeq
increases nanopore sequencing read accuracy of M13 bacteriophage DNA from
85% to 99% at 100× coverage. We also use the algorithm to assemble
Escherichia coli with 30× coverage and the λ genome at a range of
coverages from 3× to 50×. Additionally, we classify sequence variants at
an order of magnitude lower coverage than is possible with existing
methods.
提供机构:
Dryad
创建时间:
2015-09-03



