five

In Silico Prediction of Volume of Distribution in Humans. Extensive Data Set and the Exploration of Linear and Nonlinear Methods Coupled with Molecular Interaction Fields Descriptors

收藏
NIAID Data Ecosystem2026-03-09 收录
下载链接:
https://figshare.com/articles/dataset/In_Silico_Prediction_of_Volume_of_Distribution_in_Humans_Extensive_Data_Set_and_the_Exploration_of_Linear_and_Nonlinear_Methods_Coupled_with_Molecular_Interaction_Fields_Descriptors/3907350
下载链接
链接失效反馈
官方服务:
资源简介:
We present three in silico volume of distribution at steady state (VDss) models generated on a training set comprising 1096 compounds, which goes well beyond the conventional drug space delineated by the Rule of 5 or similar approaches. We have performed a careful selection of descriptors and kept a homogeneous Molecular Interaction Field-based descriptor set and linear (Partial Least Squares, PLS) and nonlinear (Random Forest, RF) models. We have tested the models, which we deem orthogonal in nature due to different descriptors and statistical approaches, with good results. In particular we tested the RF model, via a leave-class-out approach and by using a set of 34 additional compounds not used for training. We report comparable results against in vivo scaling approaches with geometric mean-fold error at or below 2 (for a set of 60 compounds with animal data available) and discuss the predictive performance based on the ionization states of the compounds. Lastly, we report the findings using a two-tier approach (classification followed by regression) based on VDss ranges, in an attempt to improve the prediction of compounds with very high VDss. We would recommend, overall, the RF model, with 33 descriptors, as the primary choice for VDss prediction in humans.
创建时间:
2016-10-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作