Development of Reliable Aqueous Solubility Models and Their Application in Druglike Analysis
收藏NIAID Data Ecosystem2026-03-06 收录
下载链接:
https://figshare.com/articles/dataset/Development_of_Reliable_Aqueous_Solubility_Models_and_Their_Application_in_Druglike_Analysis/2994700
下载链接
链接失效反馈官方服务:
资源简介:
In this work, two reliable aqueous solubility models, ASMS (aqueous solubility based on molecular surface)
and ASMS-LOGP (aqueous solubility based on molecular surface using ClogP as a descriptor), were
constructed by using atom type classified solvent accessible surface areas and several molecular descriptors
for a diverse data set of 1708 molecules. For ASMS (without using ClogP as a descriptor), the leave-one-out q2 and root-mean-square error (RMSE) were 0.872 and 0.748 log unit, respectively. ASMS-LOGP was
slightly better than ASMS (q2 = 0.886, RMSE = 0.705). Both models were extensively validated by three
cross-validation tests and encouraging predictability was achieved. High throughput aqueous solubility
prediction was conducted for a number of data sets extracted from several widely used databases. We found
that real drugs are about 20-fold more soluble than the so-called druglike molecules in the ZINC database,
which have no violation of Lipinski's “Rule of 5” at all. Specifically, oral drugs are about 16-fold more
soluble, while injection drugs are 50−60-fold more soluble. If the criterion of a molecule to be soluble is
set to −5 log unit, about 85% of real drugs are predicted as soluble; in contrast only 50% of druglike
molecules in ZINC are soluble. We concluded that the two models could be served as a rule in druglike
analysis and an efficient filter in prioritizing compound libraries prior to high throughput screenings (HTS).
创建时间:
2016-02-28



