Training data for "Identification of allelic variants in SARS-CoV-2 from deep sequencing reads"

NIAID Data Ecosystem2026-03-12 收录

下载链接：

https://zenodo.org/record/5036686

下载链接

链接失效反馈

官方服务：

资源简介：

Effectively monitoring global infectious disease crises, such as the COVID-19 pandemic, requires capacity to generate and analyze large volumes of sequencing data in near real time. These data have proven essential for monitoring the emergence and spread of new variants, and for understanding the evolutionary dynamics of the virus. Two sequencing platforms in combination with several established library preparation strategies are predominantly used to generate SARS-CoV-2 sequence data. However, data alone do not equal knowledge: they need to be analyzed. The Galaxy community developed analysis workflows to support the identification of allelic variants (AVs) in SARS-CoV-2 from deep sequencing reads. These workflows allow one to identify AVs and lineages in SARS-CoV-2 genomes with variant allele frequencies ranging from 5% to 100% (i.e., they detect variants with intermediate frequencies as well. In this tutorial we will see how to run these workflows for the different types of input data: Single end data derived from Illumina-based RNAseq experiments Paired end data derived from Illumina-based RNAseq experiments Paired-end data generated with Illumina-based Ampliconic (ARTIC) protocols ONT fastq files generated with Oxford nanopore (ONT)-based Ampliconic (ARTIC) protocols To illustrate the tutorial, we took some example datasets (paired-end data generated with Illumina-based Ampliconic (ARTIC) protocols) from COG-UK, the COVID-19 Genomics UK Consortium.

创建时间：

2021-06-28