five

Supporting data for "Atria: An Ultra-fast and Accurate Trimmer for Adapter and Quality Trimming"

收藏
DataCite Commons2025-05-26 更新2024-07-13 收录
下载链接:
http://gigadb.org/dataset/100935
下载链接
链接失效反馈
官方服务:
资源简介:
As Next Generation Sequencing takes a dominant role in terms of output capacity and sequence length, adapters attached to the reads and low-quality bases hinder the performance of downstream analysis directly and implicitly, such as producing false-positive single nucleotide polymorphisms (SNP), and generating fragmented assemblies. A fast trimming algorithm is in demand to remove adapters precisely, especially in read tails with relatively low quality. <br>We present a trimming program named Atria. Atria matches the adapters in paired reads and finds possible overlapped regions with a super-fast and carefully designed byte-based matching algorithm (<i>O(n)</i> time with <i>O(1)</i> space). Atria also implements multi-threading in both sequence processing and file compression and supports single-end reads. <br>Atria performs favorably in various trimming and runtime benchmarks of both simulated and real data with other cutting-edge trimmers. We also provide an ultra-fast and lightweight byte-based matching algorithm. The algorithm can be used in a broad range of short-sequence matching applications, such as primer search and seed scanning before alignment. <br>The Atria executables, source code, and benchmark scripts are available at https://github.com/cihga39871/Atria under the MIT license.
提供机构:
GigaScience Database
创建时间:
2021-09-29
二维码
社区交流群
二维码
科研交流群
商业服务