five

Supporting data for "SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100709
下载链接
链接失效反馈
官方服务:
资源简介:
Alignment of sequence reads generated by next-generation sequencing (NGS) is an integral part of most pipelines analyzing NGS data. A number of tools designed to quickly align large volume of sequences are already available. However, most existing tools lack explicit guarantees about their output. They also do not support searching genome assemblies, such as the human genome assembly GRCh38, that include primary and alternate sequences and placement information for alternate sequences to primary sequences in the assembly. <br>This paper describes SRPRISM (Single Read Paired Read Indel Substitution Minimizer), an alignment tool for aligning reads without splices. SRPRISM has features not available in most tools, such as (i) support for searching genome assemblies with alternate sequences, (ii) partial alignment of reads with a specified region of reads to be included in the alignment, (iii) choice of ranking schemes for alignments, and (iv) explicit criteria for search sensitivity. We compare the performance of SRPRISM to GEM, Kart, STAR, BWA-MEM, Bowtie2, Hobbes, and Yara using benchmark sets for paired and single reads of lengths 100 bp and 250 bp generated using DWGSIM. We show that SRPRISM finds the best results for most benchmark sets with error rate of up to about 2.5% and GEM performs best for higher error rates. SRPRISM is also more sensitive than other tools even when sensitivity is reduced to improve run time performance. <br>We present SRPRISM as a flexible read mapping tool that provides explicit guarantees on results.
提供机构:
GigaScience Database
创建时间:
2020-02-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作