Uncovering Internal Prediction Mechanisms of Transformer-Based Chemical Foundation Models

Name: Uncovering Internal Prediction Mechanisms of Transformer-Based Chemical Foundation Models
Creator: DataverseNL
Published: 2025-11-28 09:17:00
License: 暂无描述

DataCite Commons2025-11-28 更新2026-04-25 收录

下载链接：

https://dataverse.nl/citation?persistentId=doi:10.34894/SWCLWE

下载链接

链接失效反馈

官方服务：

资源简介：

Supervised deep learning has become a standard approach to deliver competitive predictive tools that allow relating the structure of molecules and their physicochemical features to properties such as binding to protein targets, performance as electronic materials, and reactivity. However, efforts to understand how these models learn and how specific predictions can be explained are still limited in chemistry, which hinders trust in these predictions. To overcome both the current widespread approach of considering supervised deep learning models as black boxes and the limitations of explainable deep learning based on feature attribution, we introduce two general methods from the field of mechanistic interpretability to molecular property prediction. Specifically, we leverage ablation and adapt existing techniques into the regression lens to inspect model predictions of the Transformer-based ChemBERTa foundation model. Our results for 3 ChemBERTa-based models finetuned on distinct datasets allow us to propose the internal mechanisms operable within each of the corresponding model layers that lead to the final predictions. Our results are a stepping stone towards more trustworthy deep learning models in the molecular domain.

提供机构：

DataverseNL

创建时间：

2025-11-20

5,000+

优质数据集

54 个

任务类型

进入经典数据集