five

Improved ADME Prediction by Multitask Pretraining on Predicted Data: Insights from the ASAP-Polaris-OpenADMET Blind Challenge

收藏
Figshare2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Improved_ADME_Prediction_by_Multitask_Pretraining_on_Predicted_Data_Insights_from_the_ASAP-Polaris-OpenADMET_Blind_Challenge/30931135
下载链接
链接失效反馈
官方服务:
资源简介:
Absorption, distribution, metabolism, and excretion (ADME) properties are among the key factors in determining the success of lead discovery and optimization campaigns. Fast and accurate prediction of molecular ADME profiles is hence of particular interest as a prioritization tool before costly experimental assays. However, the severe scarcity of publicly available training data for ADME prediction has hindered the development of improved machine learning models. Recently, industry teams have taken the important step to release the predicted labels from their in-house trained models for public domain chemical structures. In this paper, leveraging these large and diverse surrogate data sets, we propose the adoption of transfer learning using a simple multitask graph neural network (GNN) for rich representation learning and focused fine-tuning on experimental data. In participation of the blinded ASAP-Polaris-OpenADMET antiviral ADME challenge 2025, the approach achieved competitive results, ranking fourth on aggregated mean absolute error (MAE) and tied second on aggregated Pearson R. Post-competition optimization further pushed the performance to surpass the third-place entry in MAE, without using any proprietary data or commercial featurization methods. We further explored a pretraining strategy integrating both experimental and predicted labels, showing improvements and a promising direction for pretraining on data from multiple sources. The study presents an example of new opportunities for making use of predicted labels for pretraining and applications to real-world tasks. The code and pretrained models are available on: https://github.com/LongHung-Pham/pADME.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作