five

McGPCR: A Multimodal Learning Model with Improved Applicability Domain Characterization for Predicting G Protein-Coupled Receptor Affinity of Plastic Chemicals

收藏
Figshare2025-11-18 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/McGPCR_A_Multimodal_Learning_Model_with_Improved_Applicability_Domain_Characterization_for_Predicting_G_Protein-Coupled_Receptor_Affinity_of_Plastic_Chemicals/30648497
下载链接
链接失效反馈
官方服务:
资源简介:
A variety of chemicals in plastics may pose risks to human health, while only a limited number have been extensively studied for their toxicity. Binding to G protein-coupled receptors (GPCRs) serves as a crucial molecular initiating event in identifying chemicals that induce toxic effects in humans. Given the diversity of GPCRs and chemicals, the binding affinity remains largely elusive, necessitating high-throughput models with the functionality of integrating chemical and receptor features to enable predictions across multiple receptors. Herein, a human GPCR affinity data set was constructed, containing 96,776 records between 59,599 compounds and 109 GPCRs. A multimodal learning model, McGPCR, was built to predict the GPCR binding affinity of chemicals by integrating multimodal features of molecular graphs and receptor binding sites. The McGPCR outperformed models with chemical structures as the only predictor variables. Applicability domain (AD) characterization based on feature-activity landscape analysis was proposed, which ensures the reliability of predictions. The McGPCR, along with the AD, was employed to predict affinities of over 9000 plastic chemicals. By integration of the affinity, persistence, bioaccumulation, and production volume, 30 plastic chemicals with potentially high environmental risks were identified. The McGPCR with AD characterization can serve as a powerful tool for identifying toxic chemicals harmful to human health.
创建时间:
2025-11-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作