Average run time and memory usage to align 100 query samples and to reconstruct ancestral sequence for the SARS2-100k dataset.

NIAID Data Ecosystem2026-05-01 收录

下载链接：

https://figshare.com/articles/dataset/Average_run_time_and_memory_usage_to_align_100_query_samples_and_to_reconstruct_ancestral_sequence_for_the_SARS2-100k_dataset_/25192733

下载链接

链接失效反馈

官方服务：

资源简介：

We estimated the run time and peak memory usage for the two preprocessing steps by TIPars. The first step of query sample alignment is necessary for all other alignment-based phylogenetic placement methods, e.g., UShER, EPA-ng, IQTREE-2 and APPLES-2. For the second step, once ancestral sequences have been reconstructed, they can be reused. Here we performed 10 runs of these two preprocessing steps: 1) aligning 100 SARS-CoV-2 samples to the reference hCoV-19/Wuhan/WIV04/2019|EPI_ISL_402124 using MAFFT (—addtotop); 2) inferring the ancestral sequences for the SARS2-100k dataset using PastML (ACCTRAN method) and present average times. Tests were run on a server with 64 or 8 Intel Xeon Gold 6242 CPU cores. (XLSX)

本研究通过TIPars评估了两项预处理步骤的运行时长与峰值内存占用。查询样本比对（query sample alignment）作为第一步，是所有其他基于比对的系统发育放置（phylogenetic placement）方法（如UShER、EPA-ng、IQTREE-2与APPLES-2）的必要前置步骤。至于第二步，祖先序列经重构后即可重复复用。本次实验共对两项预处理步骤执行10轮完整运行：1）使用MAFFT（参数--addtotop）将100株严重急性呼吸综合征冠状病毒2（SARS-CoV-2）样本比对至参考序列hCoV-19/Wuhan/WIV04/2019|EPI_ISL_402124；2）使用PastML的ACCTRAN法为SARS2-100k数据集推断祖先序列，并给出平均运行时长。所有测试均在搭载64核或8核Intel Xeon Gold 6242 CPU的服务器上完成。 (XLSX)

创建时间：

2024-02-08