five

Comprehensive Global Genomic Surveillance of SARS-CoV-2 Structural Gene Mutations (S, E, M, N): High-Resolution Profiling from 2021 to 2024

收藏
DataONE2025-08-09 更新2025-11-01 收录
下载链接:
https://search.dataone.org/view/sha256:6a75357e2c36058560dd9b1991d70d37b63f56a5d73d097671829d48f62c0e67
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains amino acid mutation profiles for the SARS-CoV-2 structural genes (S, E, M, N) extracted from publicly available whole-genome sequences. Mutations are listed per sample with metadata including SampleID, country, and year where available. The data reflects mutation frequencies observed across global lineages up to 2025. Files included: Gene-specific mutation lists per sample. S_mutations.csv, E_mutations.csv, M_mutations.csv, N_mutations.csv Top 100 observed mutation patterns in the Spike (S) gene. S_top100_constellations.csv Data was processed from GISAID and GenBank sequences, aligned to Wuhan-Hu-1 (NC_045512.2), and filtered for full-length gene coverage. This resource supports studies of viral evolution, variant tracking, and genomic epidemiology.

本数据集包含从公开可用全基因组序列中提取的严重急性呼吸综合征冠状病毒2型(SARS-CoV-2)结构基因——刺突蛋白(S)、包膜蛋白(E)、膜蛋白(M)、核衣壳蛋白(N)的氨基酸突变谱。每份样本的突变信息均附带可用元数据,包括样本编号(SampleID)、采样国家及采样年份。该数据集反映了截至2025年全球各SARS-CoV-2进化分支中观测到的突变频率。 包含的文件如下: 1. 针对每份样本的基因特异性突变列表文件:S_mutations.csv、E_mutations.csv、M_mutations.csv、N_mutations.csv 2. 刺突蛋白(S)基因中观测到的前100种突变模式数据集:S_top100_constellations.csv 本数据集的数据源自GISAID及GenBank的序列,经比对至武汉-Hu-1参考序列(NC_045512.2)并经过全长基因覆盖度过滤处理。本资源可用于病毒进化研究、病毒变异追踪及基因组流行病学相关研究。
创建时间:
2025-10-29
二维码
社区交流群
二维码
科研交流群
商业服务