Future Sequon Finder source code.
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Future_Sequon_Finder_source_code_/29915612
下载链接
链接失效反馈官方服务:
资源简介:
Influenza viruses are known to evade host immune responses by shielding vulnerable surface protein epitopes via N-linked glycosylation. A program titled Future Sequon Finder was developed to predict the locations in which glycan binding sites are most likely to emerge in future influenza hemagglutinin proteins. The predictive modeling approach considers how closely sites in currently circulating strains resemble glycosylation sequons at the nucleic acid level, the surface accessibility of those sites, and the mutation frequency of amino acids at those sites that would need to change to form a glycosylation sequon. The efficacy of this model is tested using historic human H1N1 and H3N2 influenza strains along with swine H1N1 strains. Through this analysis, it is revealed that glycosylation addition events in influenza hemagglutinin proteins are typically the result of single nucleotide mutation events. It is also demonstrated that site-specific mutation frequency and surface accessibility are powerful predictors of which sites will become glycosylated in human influenza viruses when considered with the genetic composition of the sites in question. Having been designed to incorporate these factors, the program successfully predicted almost every historic sequon addition event (28/30 in human IFVs, 14/15 in swine IFVs). For human strains, it also ranked the correct near-sequons highly among falsely predicted sequons based on site-specific mutation frequency. After demonstrating the model’s power with historical data, the program was used to predict future HA glycosylation sequon locations based on currently circulating human influenza viruses.
创建时间:
2025-08-14



