five

Global SARS-CoV-2 Proteome Mutation Burden Atlas Derived From 103 Million Amino Acid Sequences Covering S, N, M, E, ORF1a, ORF1b, ORF3a, ORF6, ORF7a, ORF7b, and ORF8 Proteins Spanning 2020 to Q3-2025

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Global_SARS-CoV-2_Proteome_Mutation_Burden_Atlas_Derived_From_103_Million_Amino_Acid_Sequences_Covering_S_N_M_E_ORF1a_ORF1b_ORF3a_ORF6_ORF7a_ORF7b_and_ORF8_Proteins_Spanning_2020_to_Q3-2025/30655373
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset presents the largest known per-protein amino acid mutation burden analysis of SARS-CoV-2 to date, encompassing 103,378,188 high-quality translated sequences from global surveillance. For each of the 11 canonical SARS-CoV-2 proteins (structural, replicase, and accessory), we provide: Per-sequence mutation burden vs. Wuhan-Hu-1 referenceYear-stratified evolutionary statistics (where metadata available)Chunked, analysis-ready records in JSON.zst and TSV.zst formats SARS-CoV-2 Mutation Burden Summary ================================================== S: 9398268 seqs, mean burden = 113.66 N: 9397174 seqs, mean burden = 37.83 M: 9398092 seqs, mean burden = 12.10 E: 9398041 seqs, mean burden = 2.24 ORF1a: 9398286 seqs, mean burden = 178.16 ORF1b: 9398287 seqs, mean burden = 6940.96 ORF3a: 9398086 seqs, mean burden = 5.92 ORF6: 9397993 seqs, mean burden = 1.04 ORF7a: 9397998 seqs, mean burden = 7.06 ORF7b: 9397956 seqs, mean burden = 2.47 ORF8: 9397712 seqs, mean burden = 4.93 Note: ORF1b burden reflects alignment artifact due to frameshift-dependent expression; values not biologically comparable to other genes. Total Sequences: 103,378,188 Please properly cite this dataset if you use it. This dataset is for my upcoming journal article. Also that, earlier i started this processing for 2025 but expanded to contain 2020 up to 2025 so even tough a few folders contain label 2025 but all has coverage span from 2020 up to quarter 3 of 2025 Study & Data Processed by: TahirHB@Hotmail.Com
创建时间:
2025-11-19
二维码
社区交流群
二维码
科研交流群
商业服务