five

Nucleotide analogue tolerant synthetic RdRp mutant construct for Surveillance and Therapeutic Resistance Monitoring in SARS-CoV-2

收藏
DataCite Commons2025-06-29 更新2025-09-08 收录
下载链接:
https://figshare.com/articles/dataset/Nucleotide_analogue_tolerant_synthetic_RdRp_mutant_construct_for_Surveillance_and_Therapeutic_Resistance_Monitoring_in_SARS-CoV-2/29316056
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset comprises a synthetic construct specifically designed to facilitate the development of primers and probes capable of detecting emerging resistant strains of SARS-CoV-2. It serves as a robust reference for genomic surveillance pipelines and provides valuable insights to guide next-generation antiviral design and therapeutic strategies.The main data contributions come from GVAtlas (tahirhb.com/GVAtlas) platform. It has SARS-CoV-2 genomes processed either fully or processed for specific region of SARS-CoV-2. From that massive dataset all publicly available sequences were extracted then a representative consensus genome was built.A local custom AI model was trained on 10 Million historical genomes and data for Q1 of 2025 simulating remdesivir pressure to predict which mutations are likely to emerge under therapeutic selection.Using the output of this locally trained AI model a synthetic RdRp region was generated incorporating those predicted mutations and was inserted into the consensus backbone.This gave me a model-based remdesivir-resistant candidate genome. It is derived from real-world trends and AI-guided evolution.<b>Summary of Mutation Impacts </b>The RdRp (nsp12) mutations introduced into the consensus genome reflect evolutionary pressures simulated using AI model trained under remdesivir-like conditions. These mutations are located in functionally important regions of the polymerase and may collectively influence the enzyme's interaction with <b>nucleotide analogs</b> and its overall fidelity.The <b>P323S</b> mutation, found in motif <b>F</b>, is a known polymorphic site often linked to increased fitness under treatment pressure. It resides at the interface with nsp7 and nsp8, suggesting potential impacts on complex formation or dynamic movement during RNA synthesis.At position 680 (<b>S680N</b>) , the mutation is in proximity to the active site, and while not directly interacting with the incoming nucleotides, it may influence the structural flexibility of the enzyme, allowing the RdRp to better <b>tolerate nucleotide analogs</b> or maintain activity under drug pressure.<b>A685G</b> replaces an alanine with glycine, increasing local flexibility. While alanine is structurally rigid, glycine can induce turns or bends in secondary structures. If this change occurs within a helical region, it may alter the positioning of nearby residues, possibly reducing the affinity of remdesivir during RNA synthesis.The <b>C861K</b> and <b>N866K</b> mutations are notable for their chemical nature. Cysteine plays a crucial role in maintaining tertiary structure via disulfide bridges, and replacing it with a large, positively charged lysine could disrupt local folding or interactions. Similarly, <b>N866K</b> introduces a bulky residue in a region critical for RNA template binding or enzyme dynamics, potentially influencing elongation efficiency or fidelity .Finally, <b>R897K</b>, though a conservative substitution, could modify protein-protein interactions with cofactors or even affect host factor recruitment, indirectly influencing replication efficiency.<b>Files included:</b>3k_consensus_modified.fasta : Final mutant genomemutations_summary.csv : Mutation table for 8 genesremde_resistance_mutations.csv : RdRp-specific mutationsproteins/*.fasta : Translated protein sequences<b>Refrences:</b><b>1</b>- Chen, L., Zhang, Z., Wu, M., &amp; Xiao, J. (2021). Machine learning applications in virus-related research: A review. Briefings in Bioinformatics, 22 (1), 1 14. https://doi.org/10.1093/bib/bbaa104 <br><b>2</b>- Cock, P. J., Antao, T., Chang, J. T., Chapman, B. A., Cox, C. J., Dalke, A., &amp; de Hoon, M. J. (2009). Biopython: Freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25 (11), 1422 1423. https://doi.org/10.1093/bioinformatics/btp163 <br><b>3</b>- Elbe, S., &amp; Buckland-Merrett, G. (2017). Data, disease and diplomacy: GISAID’s innovative contribution to global health. Global Challenges, 1 (1), 33 46. https://doi.org/10.1002/gch2.1018 <br><b>4</b>- GISAID Initiative. (2024). EpiCoV Database. https://www.gisaid.org <br><b>5</b>- Grubaugh, N. D., Gangavarapu, K., Quick, J., Matteson, N. L., Debroas, D. H., Moore, A. L., &amp; Andersen, K. G. (2019). An amplicon-based sequencing framework for accurately profiling genomic variation of SARS-CoV-2. Genome Biology, 21 (1), 1 13. https://doi.org/10.1186/s13059-020-02103-z <br><b>6</b>- Li, H. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation using next-generation sequencing data. Bioinformatics, 27 (21), 2987 2993. https://doi.org/10.1093/bioinformatics/btr509 <br><b>7</b>- Peters, M. D., Godfrey, C. M., McInerney, P., Khalil, H., Parker, D., &amp; Soares, C. B. (2015). Chapter 11: Systematic reviews of complex evidence: Scoping reviews. In Joanna Briggs Institute Reviewer’s Manual (pp. 1 18). Joanna Briggs Institute.<br><b>8</b>- Tchesnokov, E. P., Feng, J. Y., Porter, D. P., &amp; Götte, M. (2020). Effect of Remdesivir on the Mutation Rate of SARS-CoV-2. Journal of Virology, 94 (21), e01458-20. https://doi.org/10.1128/JVI.01458-20 <br><b>9</b>- Wu, F., Zhao, S., Yu, B., Chen, Y. M., Wang, W., Song, J. D., &amp; Tan, W. (2020). A new coronavirus associated with human respiratory disease in China. Nature, 579 (7798), 265 269. https://doi.org/10.1038/s41586-020-2008-3 <br><b>10</b>- Zou, J., Huss, M., Abid, A., Mohammadi, P., Engreitz, J., &amp; Singh, A. (2021). A primer on deep learning in genomics. Nature Genetics, 53 (1), 1 8. https://doi.org/10.1038/s41588-020-00766-z <br><b>11</b>- Agostini, M. L., Pruijssers, A. J., Chappell, J. D., Gribble, J., Lu, X., Andres, S., &amp; Denison, M. R. (2022). Small molecule inhibitors targeting replication enzymes of RNA viruses. Annual Review of Pharmacology and Toxicology, 62 , 253 273. https://doi.org/10.1146/annurev-pharmtox-051221-114943 <br><b>12</b>- Figshare. (2024). Digital repository for research. https://figshare.com <br><b>13</b>- NCBI Resource Coordinators. (2023). GenBank database. National Center for Biotechnology Information . https://www.ncbi.nlm.nih.gov/genbank/ <br><b>Note:</b>This genome represents a candidate for surveillance and vaccine preparedness.Novel mutations include <b>P323S, A685G, R897K</b> in <b>nsp12</b> (RdRp).For any query / related data / files i can be contacted at tahirhb@hotmail.com
提供机构:
figshare
创建时间:
2025-06-29
二维码
社区交流群
二维码
科研交流群
商业服务