Developing QSAR Models with Defined Applicability Domains on PPARγ Binding Affinity Using Large Data Sets and Machine Learning Algorithms
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://figshare.com/articles/dataset/Developing_QSAR_Models_with_Defined_Applicability_Domains_on_PPAR_Binding_Affinity_Using_Large_Data_Sets_and_Machine_Learning_Algorithms/14515192
下载链接
链接失效反馈官方服务:
资源简介:
Chemicals
may cause adverse effects on human health through binding
to peroxisome proliferator-activated receptor γ (PPARγ).
Hence, binding affinity is useful for evaluating chemicals with potential
endocrine-disrupting effects. Quantitative structure–activity
relationship (QSAR) regression models with defined applicability domains
(ADs) are important to enable efficient screening of chemicals with
PPARγ binding activity. However, lack of large data sets hindered
the development of QSAR models. In this study, based on PPARγ
binding affinity data sets curated from various sources, 30 QSAR models
were developed using molecular fingerprints, two-dimensional descriptors,
and five machine learning algorithms. Structure–activity landscapes
(SALs) of the training compounds were described by network-like similarity
graphs (NSGs). Based on the NSGs, local discontinuity scores were
calculated and found to be positively correlated with the cross-validation
absolute prediction errors of the models using the different training
sets, descriptors, and algorithms. Moreover, innovative ADs were defined
based on pairwise similarities between compounds and were found to
outperform some conventional ADs. The curated data sets and developed
regression models could be useful for evaluating PPARγ-involved
adverse effects of chemicals. The SAL analysis and the innovative
ADs could facilitate understanding of prediction results from QSAR
models.
创建时间:
2021-04-29



