five

Classification Pretraining Enhances Performance of Target-Based Affinity Prediction for Reproductive Toxicity Assessment

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Classification_Pretraining_Enhances_Performance_of_Target-Based_Affinity_Prediction_for_Reproductive_Toxicity_Assessment/32019591
下载链接
链接失效反馈
官方服务:
资源简介:
Reproductive toxicity is challenging to assess because of its diverse phenotypes and complex mechanisms; target-prediction models enable early identification of toxic potential. However, affinity prediction models suffer from limited training data and poor generalization to novel chemical scaffolds, which hinders their application in prospective toxicity assessment. Here, we propose a classification pretraining-regression fine-tuning framework that leverages large-scale binary activity data to learn generalizable compound-protein interaction patterns. Systematic evaluations across six dual-encoder models and three data-split strategies demonstrate consistent performance gains. Under the most stringent cluster-split scenario, our framework achieved an average R2 improvement of 0.324, demonstrating substantially enhanced generalization to novel chemotypes. Using the best-performing model (R2 = 0.797; MSE = 0.370), we predicted compound affinities across 81 reproductive targets, generating target affinity spectra that distinguished toxicity effects, identified both broad-spectrum and target-specific structural alerts, and revealed key targets for six sex-specific reproductive diseases. By predicting binding affinity, our framework not only enhances generalization but also enables interpretable, mechanism-guided reproductive toxicity assessment.
创建时间:
2026-04-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作