five

Extensive Alignment Dataset of COVID-19 Gene Primers and Probes Across SARS-CoV-2 Variants

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Extensive_Alignment_Dataset_of_COVID-19_Gene_Primers_and_Probes_Across_SARS-CoV-2_Variants/26530669
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains a comprehensive analysis of COVID-19 genetic sequences focused on four key genes: Spike Glycoprotein, Envelope Protein, Nucleocapsid Protein, and 3' UTR(3' untranslated region). It comprises 20 Excel files, each holding 100,000 samples. A Python script was employed to evaluate primer sets using local alignment against sequences from the NCBI database, with lineage determination via the Pangolin tool. Each Excel file contains four sheets, one per gene, with columns for accession ID, sample name, primer sequences, alignment metrics, and lineage. The dataset includes primer analysis for 2 million sequences across all genes and probe analysis for 1 million sequences in a separate set of 10 Excel files. The dataset contains an additional Excel file that contains the count of lineage samples to which the primer and the probes were aligned.
创建时间:
2025-08-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作