five

Predicting Metalloprotein Redox Potentials with Machine Learning: A Focus on Iron–Sulfur Systems

收藏
Figshare2025-10-30 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Predicting_Metalloprotein_Redox_Potentials_with_Machine_Learning_A_Focus_on_Iron_Sulfur_Systems/30490128
下载链接
链接失效反馈
官方服务:
资源简介:
Iron–Sulfur (Fe–S) proteins play essential roles in a wide range of biological processes, from energy conversion and respiration to DNA repair and redox signaling, making them highly relevant to both bioenergetics and human health. These proteins mediate electron transfer through finely tuned reduction potentials (RP) defined by their metal cofactors. However, predicting RP from protein structures remains a significant challenge due to the complex electronic nature of Fe–S clusters and their intricate coupling with the surrounding protein environment. This complexity limits our ability to systematically modulate RP, hindering efforts in high-throughput and rational protein design. In this study, we introduce a Machine Learning (ML) framework, FeS-RedPred, for accurate and scalable prediction of RP in Fe–S proteins. We focus on mono- and binuclear clusters, such as rubredoxins and [2Fe–2S] clusters of ferredoxins, Rieske, and mitoNEET-type, which serve as ideal model systems thanks to the availability of abundant structural and electrochemical data. Our approach relies on structure-derived molecular descriptors computed across multiple spatial scales, from local atomic environments to global protein-level features. Using Extreme Gradient Boosting (XGB) models, we achieve a mean absolute error of ∼40 mV, which is competitive with state-of-the-art computational approaches, while also providing a highly efficient compromise between accuracy and computational cost. Beyond predictive accuracy, our model also offers indications about the determinants of RP, enabling a basis for interpretation and potentially guiding protein engineering. This work provides a valuable foundation for understanding the redox behavior of metalloproteins, enabling the high-throughput prediction of redox potentials and informing data-driven design across diverse protein families.
创建时间:
2025-10-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作