five

A Nextflow-Based Automated Pipeline for Viral Assembly and Characterisation (EVEREST)

收藏
Figshare2025-03-07 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/A_Nextflow-Based_Automated_Pipeline_for_Viral_Assembly_and_Characterisation_EVEREST_/28553732
下载链接
链接失效反馈
官方服务:
资源简介:
EVEREST (pipEline for Viral assEmbly and chaRactEriSaTion) is a comprehensive, end-to-end pipeline designed for virus discovery and characterization. Implemented in Nextflow, it processes Illumina single- and paired-end reads through five key phases: pre-processing, filtering, de novo assembly, refinement, and classification. The pipeline ensures high-quality data by trimming, removing host sequences, eliminating duplicates, and applying digital normalization. It then assembles viral genomes using a de novo assembly strategy, clusters similar contigs, captures viral genomes, and assesses their quality. Finally, EVEREST classifies viral contigs using the NCBI (nucleotide) and Uniprot (amino acid) databases, providing a robust framework for identifying and characterizing viruses from sequencing data.
创建时间:
2025-03-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作