five

Uncovering Internal Prediction Mechanisms of Transformer-Based Chemical Foundation Models

收藏
DataCite Commons2025-11-28 更新2026-04-25 收录
下载链接:
https://dataverse.nl/citation?persistentId=doi:10.34894/SWCLWE
下载链接
链接失效反馈
官方服务:
资源简介:
Supervised deep learning has become a standard approach to deliver competitive predictive tools that allow relating the structure of molecules and their physicochemical features to properties such as binding to protein targets, performance as electronic materials, and reactivity. However, efforts to understand how these models learn and how specific predictions can be explained are still limited in chemistry, which hinders trust in these predictions. To overcome both the current widespread approach of considering supervised deep learning models as black boxes and the limitations of explainable deep learning based on feature attribution, we introduce two general methods from the field of mechanistic interpretability to molecular property prediction. Specifically, we leverage ablation and adapt existing techniques into the regression lens to inspect model predictions of the Transformer-based ChemBERTa foundation model. Our results for 3 ChemBERTa-based models finetuned on distinct datasets allow us to propose the internal mechanisms operable within each of the corresponding model layers that lead to the final predictions. Our results are a stepping stone towards more trustworthy deep learning models in the molecular domain.
提供机构:
DataverseNL
创建时间:
2025-11-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作