Denoising Drug Discovery Data for Improved Absorption, Distribution, Metabolism, Excretion, and Toxicity Property Prediction
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Denoising_Drug_Discovery_Data_for_Improved_Absorption_Distribution_Metabolism_Excretion_and_Toxicity_Property_Prediction/26508849
下载链接
链接失效反馈官方服务:
资源简介:
Predicting
absorption, distribution, metabolism, excretion, and
toxicity (ADMET) properties of small molecules is a key task in drug
discovery. A major challenge in building better ADMET models is the
experimental error inherent in the data. Furthermore, ADMET predictors
are typically regression tasks due to the continuous nature of the
data, which makes it difficult to apply existing denoising methods
from other domains as they largely focus on classification tasks.
Here, we develop denoising schemes based on deep learning to address
this. We find that the training error (TE) can be used to identify
the noise in regression tasks while ensemble-based and forgotten event-based
metrics fail to detect the noise. The most significant performance
increase occurs when the original model is finetuned with the denoised
data using TE as the noise detection metric. Our method has the ability
to improve models with medium noise and does not degrade the performance
of models with noise outside this range (low noise and high noise
regimes). To our knowledge, our denoising scheme is the first to improve
model performance for ADMET data and has implications for improving
models for experimental assay data in general.
创建时间:
2024-08-07



