five

Data from: AftrRAD: a pipeline for accurate and efficient de novo assembly of RADseq data

收藏
DataONE2015-01-29 更新2024-06-27 收录
下载链接:
https://search.dataone.org/view/null
下载链接
链接失效反馈
官方服务:
资源简介:
An increase in studies using restriction site-associated DNA sequencing (RADseq) methods has led to a need for both the development and assessment of novel bioinformatic tools that aid in the generation and analysis of these data. Here, we report the availability of AftrRAD, a bioinformatic pipeline that efficiently assembles and genotypes RADseq data, and outputs these data in various formats for downstream analyses. We use simulated and experimental data sets to evaluate AftrRAD's ability to perform accurate de novo assembly of loci, and we compare its performance with two other commonly used programs, stacks and pyrad. We demonstrate that AftrRAD is able to accurately assemble loci, while accounting for indel variation among alleles, in a more computationally efficient manner than currently available programs. AftrRAD run times are not strongly affected by the number of samples in the data set, making this program a useful tool when multicore systems are not available for parallel processing, or when data sets include large numbers of samples.

随着限制性酶切位点相关DNA测序(restriction site-associated DNA sequencing, RADseq)技术在研究中的应用日益广泛,开发并评估可辅助此类数据生成与分析的新型生物信息学工具已成为迫切需求。本研究介绍了一款名为AftrRAD的生物信息学分析流程,其可高效组装RADseq数据并完成基因分型,同时将数据以多种格式输出以供后续分析使用。本研究通过模拟数据集与实验数据集,评估AftrRAD对基因座进行精准从头组装的能力,并将其性能与两款常用软件stacks和pyrad进行对比。研究结果表明,相较于现有同类工具,AftrRAD在兼顾等位基因间的插入缺失(insertion-deletion, indel)变异的前提下,能够以更高的计算效率完成基因座的精准组装。AftrRAD的运行时长不受数据集中样本数量的显著影响,因此在无法借助多核系统开展并行计算,或是数据集包含大量样本的场景中,该工具均具备极佳的实用价值。
创建时间:
2015-01-29
二维码
社区交流群
二维码
科研交流群
商业服务