Diamond NCBI Genbank Viral database for SOVAP
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7697519
下载链接
链接失效反馈官方服务:
资源简介:
Diamond NCBI Genbank Viral database
Database type: Diamond database
Database format version: 3
Label: 2023-03-18_18-40-17
Sequences: 3,191,190
Sum length: 824,564,244
Assembly summary entries: 58,201
--------------------------------------------------------
SOVAP v.1.3: GitHub
Soil Virome Analysis Pipeline
Description
The study of viral communities in complex environmental samples, such as soil, can provide valuable insights into the diversity and functions of viral communities in the ecosystem. However, processing and analyzing of virome data can be a challenging task that requires the integration of various computational tools and techniques.
To address these challenges, we have developed SOVAP pipeline that utilizes a suite of state-of-the-art tools for processing, analysis, and annotation viromics and metagenomics data.
It utilizes various tools such as Fastp and Centrifuge for preprocessing and contamination removal, geNomad, Diamond and Megan for identification and annotation of viral contigs which are assembled and clustered using Megahit and CD-HIT. Additionally, this pipeline provides an estimate of the abundance of viral contigs, allowing for a more comprehensive understanding of the virome within the sample. The integration of these tools offers a reliable and effective means of taxonomy classification and annotation of viral contigs, aiding researchers in gaining insight into the composition and function of the virome within the analyzed sample.
By integrating the SOVAP pipeline with IMG/VR and geNomad, it is possible to identify a wider range of viruses, including those that were previously unknown.
The batch-mode script allows for the processing of multiple datasets using the SOVAP pipeline. This feature is particularly useful for large-scale analyses, such as those involving multiple environmental samples or large sequencing datasets.
创建时间:
2023-03-22



