Contributions of the different interface features to the logistic regression models.
收藏Figshare2015-12-02 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/_Contributions_of_the_different_interface_features_to_the_logistic_regression_models_/253938
下载链接
链接失效反馈官方服务:
资源简介:
The 3 sequence descriptors are i) the similarity of the residue of interest with its structural equivalent in the interolog expressed as the substitution BLOSUM62 score, ii) the average of the substitution BLOSUM62 score for all the residues contacting the residue of interest (its “environment”), iii) the overall minimum sequence identity with the interolog (somehow correlated to the iRMSD). The 3 geometric descriptors are iv) the core/support/rim category of the residue of interest, v) the number of atomic contacts in which the residue of interest is involved, vi) the distance of the residue to the geometric center of the interface (normalized to a maximum of 1 for each interface). The logistic regression coefficients were averaged over the ten repeats. The interface category is a discrete feature and therefore its contribution will be 0 if the residue belongs to the core (default situation), 0.96 in the switching out predictor (resp. 0.23 in the contact conservation predictor) if the residue belongs to the support region and 1.06 (resp. −0.21) if the residue belongs to the rim. The standard deviations correspond to the variation of the coefficients over the ten repeats of the random partition of the residues into training and test datasets. The significance of each parameter was assessed from the z-tests performed on the logistic regression coefficients: significance values were always found to be Text S1 and in Table S3 in Text S2.
创建时间:
2015-12-02



