Extensive Alignment Dataset of COVID-19 Gene Primers and Probes Across SARS-CoV-2 Variants
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Extensive_Alignment_Dataset_of_COVID-19_Gene_Primers_and_Probes_Across_SARS-CoV-2_Variants/26530669
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains a comprehensive analysis of COVID-19 genetic sequences focused on four key genes: Spike Glycoprotein, Envelope Protein, Nucleocapsid Protein, and 3' UTR(3' untranslated region). It comprises 20 Excel files, each holding 100,000 samples. A Python script was employed to evaluate primer sets using local alignment against sequences from the NCBI database, with lineage determination via the Pangolin tool. Each Excel file contains four sheets, one per gene, with columns for accession ID, sample name, primer sequences, alignment metrics, and lineage. The dataset includes primer analysis for 2 million sequences across all genes and probe analysis for 1 million sequences in a separate set of 10 Excel files. The dataset contains an additional Excel file that contains the count of lineage samples to which the primer and the probes were aligned.
创建时间:
2025-08-20



