Protein-protein interactions decoys datasets for machine learning algorithm development
收藏DataCite Commons2022-06-30 更新2024-07-13 收录
下载链接:
https://repository.kaust.edu.sa/handle/10754/666961
下载链接
链接失效反馈官方服务:
资源简介:
This is the most complete and diverse protein docking decoys set derived from the Benchmark5, Scorers_set. We used three different rigid-body docking programs to generate the decoys for the Bechmark5. We analyzed all docking decoys with more than 150 different scoring functions from different sources ( CCharppi, FreeSASA, CIPS, CONSRANK). We provide a balanced and unbalanced version of the data. This balanced data is intended for the training and test of machine learning algorithms. the unbalanced data is provided to simulated the real-world scenario. We also provide a set of rigid-body docking decoys from Interactome3D that spans 1391 interactions. We obtained the labels for this set using a weakly-supervised approach we called hAIkal. We used this data to augment the train data and improve machine learning classifiers.
提供机构:
KAUST Research Repository
创建时间:
2021-01-20



