MPCD: A Multitask Graph Transformer for Molecular Property Prediction by Integrating Common and Domain Knowledge
收藏Figshare2024-12-02 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/MPCD_A_Multitask_Graph_Transformer_for_Molecular_Property_Prediction_by_Integrating_Common_and_Domain_Knowledge/27941052
下载链接
链接失效反馈官方服务:
资源简介:
Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.
创建时间:
2024-12-02



