Replication Data for: Pre-pandemic artificial MERS analog of polyfunctional SARS-CoV-2 S1/S2 furin cleavage site domain is unique among spike proteins of genus Betacoronavirus
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://doi.org/10.7910/DVN/GBGPKY
下载链接
链接失效反馈官方服务:
资源简介:
Representative set of 2,465 betacoronavirus S protein overlapping homologous superfamily sequences was retreived in fasta format on 4 December 2022 from the InterPro repository at https://www.ebi.ac.uk/interpro/entry/InterPro/IPR042578/. From these sequences were extracted 98,122 furin cleavage site (FCS) motifs of 20 amino acid length, including overlapping sequences, using the FindFur algorithm as described by (Gu, 2020) and deposited on 15 December 2020 at the GitHub software repository at https://github.com/chwisteeng/FindFur. These sequences were individually checked for The/Ser O-glycosite residue pairs with the standard prediction software NetOGlyc4.0 (Steentoft et al., 2013) as available at https://services.healthtech.dtu.dk/services/NetOGlyc-4.0/. The bioinformatics nuclear localization signal prediction, including the positive predictions for pat7 in SARS-CoV-2 and in MERS_MA30 CoV, was performed by the PSORT algorithm available as a webservice at https://wolfpsort.hgc.jp/ which is based on the work of Nakai and Horton (Nakai and Horton, 1999). Comprehensive sequence database searches using were performed using the NCBI protein BLAST (BLASTP) algorithm with webservice available at https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins; the following BLASTP search parameters and settings were used: Word size=2; Expect value=200000; Hitlist size=500; Gapcosts=9,1; Matrix=PAM30; Filter string= F; Genetic Code=1;Window Size=40; Threshold=11; Composition-based stats=0; Database Posted date=Jan 19, 2023 2:59 AM; Number of letters=17,117,563; Number of sequences=10,766; Entrez query: Includes: Be-tacoronavirus (taxid:694002) Excludes: SARS-CoV-2 (taxid:2697049). The pat7 input query consensus motif sequences were TXXPR(K/H/R)XRSX and TXXPRX(K/H/R)RSX. The resulting text output of this sequence search was compiled and deposited in Data File 1
创建时间:
2024-08-01



