five

NRPStransformer, an Accurate Adenylation Domain Specificity Prediction Algorithm for Genome Mining of Nonribosomal Peptides

收藏
Figshare2025-08-25 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/NRPStransformer_an_Accurate_Adenylation_Domain_Specificity_Prediction_Algorithm_for_Genome_Mining_of_Nonribosomal_Peptides/29984110
下载链接
链接失效反馈
官方服务:
资源简介:
Nonribosomal peptides serve as pivotal sources for drug discovery. Accurate prediction of the substrate specificity of adenylation domains in nonribosomal peptide synthetases is crucial for genome mining of nonribosomal peptides, yet current prediction methods fall short in accuracy. In this work, we analyzed 4,100 adenylation domains from documented nonribosomal peptide synthetases and found that the flavodoxin-like subdomain universally governs substrate specificity in all bacterial adenylation domains and that its phylogenetic analysis can correlate the sequences of adenylation domains and their substrate specificity. Leveraging the sequences within the flavodoxin-like subdomain, we developed a substrate specificity prediction algorithm using a protein language model, achieving 92% overall prediction accuracy for 43 frequently observed amino acids, significantly improving the prediction reliability. The efficacy of our prediction tool was validated through targeted genome mining, which led to the discovery of novel antimicrobial peptides. Our work lays a foundation to understand the sequence-to-function relationship of the bacterial adenylation domain and will facilitate the exploitation of nonribosomal peptides. NRPStransformer is available at http://www.nrpstransformer.cn.
创建时间:
2025-08-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作