five

ViroSeek: a viral detection pipeline for second-generation sequencing

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/ERP176786
下载链接
链接失效反馈
官方服务:
资源简介:
Viruses represent a major public health concern, whose emergence is exacerbated by climate change and globalisation. Virome analysis is therefore becoming an important tool for monitoring and managing infectious diseases, but it remains a technical challenge. While several pipelines exist (eg: VirSorter2, VIBRANT, PIMGAVir), their complexity, multiple options and technical requirements cause a lack of accessibility. In this context, we present ViroSeek, a lightweight, reproducible and accessible bioinformatics pipeline specifically designed for the taxonomic analysis of second-generation sequencing data. ViroSeek performs a series of automated steps: quality control (FastQC), sequence trimming (TrimGalore), elimination of non-viral sequences (BBduk), assembly (SPAdes), taxonomic assignment (Diamond and Taxonkit), relative quantification by remapping reads (minimap2) and removal of PCR duplicates (Samtools). The whole process is designed to produce a clear, usable viral taxonomy table that is suitable for diversity studies. ViroSeek was empirically validated on enriched control samples containing a known panel of viruses. All the expected viruses were correctly detected. Bacterial and host contaminant sequences were effectively removed. One case of weak cross-contamination was also detected, caused by the preparation of the libraries, confirming the sensitivity of the pipeline. The pipeline is freely available and fully documented, supporting its adoption and adaptation by the research community.
创建时间:
2026-03-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作