UnCoVar: A Reproducible and Scalable Workflow for Transparent and Robust Virus Variant Calling and Lineage Assignment using SARS-CoV-2 as an example
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/ERP158350
下载链接
链接失效反馈官方服务:
资源简介:
At a global scale, the SARS-CoV-2 virus did not remain in its initial genotype for a long period of time, with the first global reports of variants of concerns (VOCs) in late 2020. Subsequently, genome sequencing became an indispensable tool to characterize the ongoing pandemic, particularly for typing SARS-CoV-2 samples obtained from patients or environmental surveillance. For such SARS-CoV-2 typing, various in-vitro and in-silico workflows exist, yet to date no systematic cross-platform validation attempt is known to the authors. In this work, we present the first comprehensive cross-platform evaluation and validation of in-silico SARS-CoV-2 typing workflows. The evaluation relies on a dataset of 54 patient-derived samples sequenced with several different in-vitro approaches on all relevant state-of-the-art sequencing platforms. Moreover, we present UnCoVar, a robust, production-grade reproducible SARS-CoV-2 typing workflow that outperforms all other tested approaches in terms of precision and recall. In many ways, the SARS-CoV-2 pandemic served as an accelerator for the development of techniques and analytical approaches. We believe that this can serve as a blueprint for dealing with future pandemics. Accordingly, UnCoVar is easily generalizable towards other viral pathogens and future pandemics. The fully automated workflow assembles virus genomes from patient samples, identifies existing lineages, and provides high resolution insights into individual mutations. UnCoVar includes extensive quality control and automatically generates interactive visual reports. UnCoVar is implemented as a Snakemake workflow. The open-source code is available under a BSD 2-clause license at github.com/IKIM-Essen/uncovar.
创建时间:
2024-07-20



