five

Mutation Frequency Profiling and Evolutionary Dynamics of the SARS-CoV-2 Spike Gene Across 2.8 Million Randomly Sampled Genomes

收藏
Figshare2025-08-06 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Mutation_Frequency_Profiling_and_Evolutionary_Dynamics_of_the_SARS-CoV-2_Spike_Gene_Across_2_8_Million_Randomly_Sampled_Genomes/29847212
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset presents a comprehensive analysis of mutation variability across key functional regions of the SARS-CoV-2 Spike (S) gene, derived from over 2.8 million full-length high quality genome sequences. Using the Wuhan-Hu-1 reference (NC_045512.2), we identified mutation frequencies and average mutation rates per amino acid for critical regions including the Receptor-Binding Domain (RBD), N-terminal Domain (NTD), Furin Cleavage Site, and others. Results highlight the dynamic evolution of the Spike protein, with the NTD and RBD exhibiting the highest mutation pressures, reflecting their roles in immune evasion and receptor binding. Chart attached explains mutation frequency across the SARS-CoV-2 Spike gene, showing highest variability in the N-terminal Domain (NTD) and Receptor-Binding Domain (RBD). Analysis based on 2.8 million genomes aligned to Wuhan-Hu-1 (NC_045512.2). This resource enables researchers to track evolutionary trends, prioritize surveillance targets, and support vaccine and therapeutic development.

本数据集针对严重急性呼吸综合征冠状病毒2型(SARS-CoV-2)刺突(Spike, S)基因的关键功能区域突变变异特征开展了全面分析,数据源自超过280万条高质量全长基因组序列。本研究以武汉-Hu-1参考株(NC_045512.2)作为参照序列,对受体结合域(Receptor-Binding Domain, RBD)、N端结构域(N-terminal Domain, NTD)、弗林酶切位点等关键区域的各氨基酸位点,鉴定了其突变频率与平均突变率。研究结果揭示了刺突蛋白的动态演化规律:N端结构域与受体结合域承受着最高的突变压力,这与其在免疫逃逸与病毒受体结合过程中的核心功能高度契合。随附图表展示了SARS-CoV-2刺突基因的突变频率分布,可见N端结构域与受体结合域的变异程度最为显著。本分析基于280万条比对至武汉-Hu-1参考株(NC_045512.2)的基因组序列完成。本数据集可为研究人员追踪病毒演化趋势、确定监测优先靶点、支持疫苗与治疗药物开发提供重要支撑。
创建时间:
2025-08-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作