Models for protein solubilities based on molecular descriptors.
收藏Figshare2015-12-02 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/_Models_for_protein_solubilities_based_on_molecular_descriptors_/669525
下载链接
链接失效反馈官方服务:
资源简介:
Results from three different linear regression models for protein solubilities, combining the protein net-charge (q) with one of the three descriptors dipole-moment (p), normalized SAP-score (nSAP), or largest SAP value (SAPmax), and from the CCSol web-server. Included are the coefficients of the linear regression models (Eq.3), the correlations between experimental and calculated solubility (), and the P-value (probability that the observed correlation is coincidental). Data are given for two protein sets: 18 proteins from EColi-K12 (setA), and 20 mutations of RNAseSA (setB).
本数据集包含三种针对蛋白质溶解度的线性回归模型(linear regression model)预测结果,此类模型以蛋白质净电荷(protein net-charge,q)为基础变量,分别结合三类描述符中的一类构建:偶极矩(dipole-moment,p)、标准化SAP评分(normalized SAP-score,nSAP)以及最大SAP值(largest SAP value,SAPmax),所有预测结果均源自CCSol服务器(CCSol web-server)。本数据集涵盖线性回归模型的系数(对应公式3,Eq.3)、实验溶解度与计算溶解度之间的相关系数(),以及P值(P-value,即观测到的相关关系纯属偶然的概率)。本次提供的数据包含两组蛋白质数据集:其一为来自大肠杆菌K12(EColi-K12)的18种蛋白质(setA),其二为核糖核酸酶SA(RNAseSA)的20种突变体(setB)。
创建时间:
2015-12-02



