Comparative Genomic Analysis of Coronaviruses Across 2003–2025: Insights into CpG Dynamics and Mutation Profiles Relative to Wuhan-Hu-1 (NC_045512)
收藏Figshare2025-04-01 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Comparative_Genomic_Analysis_of_Coronaviruses_Across_2003_2025_Insights_into_CpG_Dynamics_and_Mutation_Profiles_Relative_to_Wuhan-Hu-1_NC_045512_/28711058
下载链接
链接失效反馈官方服务:
资源简介:
This project analyzes the genomic features of coronaviruses, focusing on CpG dynamics and mutation profiles relative to the reference genome Wuhan-Hu-1 (NC_045512). Key findings include:CpG Analysis : Sliding window analysis was performed to calculate CpG counts and observed/expected (O/E) ratios across the genomes of multiple coronaviruses.Mutation Profiles : Sequences were aligned using MAFFT, and mutations relative to Wuhan-Hu-1 were identified.Visualization : Bar charts and line plots were generated to summarize CpG counts, O/E ratios, and mutation distributions.Dataset Includes:Combined FASTA file (`combined_sequences.fasta`) containing sequences from 11 coronaviruses.Aligned sequences (`aligned_sequences.fasta`) generated using MAFFT.Mutation analysis results (`mutations_analysis.txt`) showing mutations relative to Wuhan-Hu-1.- Visualizations:Bar chart showing the number of mutations per genome.Scripts used for converting GenBank files to FASTA, aligning sequences, identifying mutations, and generating visualizations.Input GenBank files.Methods:1. Converted GenBank files to FASTA format using Biopython.2. Aligned sequences using MAFFT.3. Identified mutations relative to Wuhan-Hu-1 using a custom Python script.4. Generated visualizations using Matplotlib.Implications:This work contributes to understanding the genomic architecture and evolutionary relationships of coronaviruses. The findings align with broader trends observed in coronavirus genomics, highlighting shared mechanisms of CpG depletion and mutation patterns.Published on Figshare:
创建时间:
2025-04-01



