Sequence Data
收藏DataCite Commons2025-06-01 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/Sequence_Data/12408653/2
下载链接
链接失效反馈官方服务:
资源简介:
Over the years, several methods have been proposed for the computational PPI prediction with different performance evaluation strategies. While attempting to benchmark performance scores, most of these methods often suffer with ill-treated cross-validation strategies, adhoc selection of positive/negative samples etc. To address these issues, in our proposed multi-level feature based PPI prediction approach <i>JUPPI</i>, using sequence, domain and GO information as features, a refined evaluation strategy has been introduced. During the evaluation process, we first extract high quality negative data using three-stage filtering, and then introduce a pair-input based cross validation strategy with three difficulty levels for test-set predictions. Our proposed evaluation strategy reduces the component-level overlapping issue in test sets. Performance of <i>JUPPI</i> is compared with those of the <i>state-of-the-art</i> approaches in this domain and tested on six independent PPI datasets. In almost all the datasets, <i>JUPPI</i> outperforms the <i>state-of-the-art</i> not only at human proteome level for PPI prediction, but also for prediction of interactors for intrinsic disordered human proteins.
提供机构:
figshare
创建时间:
2020-06-03



