five

Supporting data for "Accurate gene consensus at low nanopore coverage"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/102265
下载链接
链接失效反馈
官方服务:
资源简介:
Nanopore technologies allow high throughput sequencing of long strands of DNA at the cost of a relatively large error rate. This limits its use in the reading of amplicon libraries in which there are only a few mutations per variant and therefore they are easily confused with the sequencing noise. Consensus calling strategies reduce the error but sacrifice part of the throughput on reading typically 30 to 100 times each member of the library. <br>In this work, we develop SINGLe (SNPs In Nanopore reads of Gene Libraries), an error correction method to reduce the noise in nanopore reads of amplicons containing point variations. SINGLe exploits that in an amplicon library, all reads are very similar to a wild type sequence, from which it is possible to experimentally characterize the position-specific systematic sequencing error pattern. Then, it uses this information to reweight the confidence given to nucleotides that do not match the wild type in individual variant reads, and incorporates it on the consensus calculation. <br>We tested SINGLe in a mutagenic library of the KlenTaq polymerase gene, where the true mutation rate was below the sequencing noise. We observed that contrary to other methods, SINGLe compensates for the systematic errors made by the basecallers. Consequently, SINGLe converges to the true sequence using as little as 5 reads per variant, fewer than other available methods.
提供机构:
GigaScience Database
创建时间:
2022-09-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作