five

Extracting Residue Solvent Exposure from Covalent Labeling Data with Machine Learning: A Hybrid Approach for Protein Structure Prediction

收藏
Figshare2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Extracting_Residue_Solvent_Exposure_from_Covalent_Labeling_Data_with_Machine_Learning_A_Hybrid_Approach_for_Protein_Structure_Prediction/29115275
下载链接
链接失效反馈
官方服务:
资源简介:
Hydroxyl radical protein footprinting (HRPF) coupled with mass spectrometry yields information about residue solvent exposure and protein topology. However, data from these experiments are sparse and require computational interpretation to generate useful structural insight. We previously implemented a Rosetta algorithm that uses experimental HRPF data to improve protein structure prediction. Modern structure prediction methods, such as AlphaFold2 (AF2), use machine learning (ML) to generate their predictions. Implementation of an HRPF-guided version of AF2 is challenging due to the substantial amount of training data required and the inherently abstract nature of ML networks. Thus, here we present a hybrid method that uses a light gradient boosting machine to predict residue solvent accessibility from experimental HRPF data. These predictions were subsequently used to improve Rosetta structure prediction. Our hybrid approach identified models with atomic-level detail for all four proteins in our benchmark set. These results illustrate that it is possible to successfully use ML in combination with HRPF data to accurately predict protein structures.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作