five

Automated DFT-Machine Learning Integration Enables Data-Efficient and Generalizable Feasibility Predictions in Metallaphotoredox sp2–sp3 Cross-Coupling Reactions

收藏
Figshare2026-02-10 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Automated_DFT-Machine_Learning_Integration_Enables_Data-Efficient_and_Generalizable_Feasibility_Predictions_in_Metallaphotoredox_sp_sup_2_sup_sp_sup_3_sup_Cross-Coupling_Reactions/31310236
下载链接
链接失效反馈
官方服务:
资源简介:
Nickel/photoredox catalysis in cross-coupling reactions offers mild operating conditions for efficient C–C bond formation, expanding synthetic access to pharmaceutically relevant molecules. However, routine implementation of such reactions remains constrained by the intricate reaction mechanism and limited availability of experimental data, which complicate optimization tasks and the development of predictive models. The integration of quantum-mechanical (QM) calculations with machine learning (ML) has proven to be effective for developing predictive models of complex reactions with sparse experimental data. Here, we present a combined approach that integrates automated density functional theory calculations, ML, and parallel synthesis to develop quantum mechanics-machine learning (QM-ML) models for the nickel metallophotoredox cross-coupling reaction feasibility prediction. Random-Forest classification models are trained to predict the outcome of a given reaction using DFT-computed descriptors from automatically generated 3D structures of catalytic cycle intermediates. We demonstrate the broad applicability of this approach, applying it to a diverse data set encompassing four reaction subtypes, namely, bromide cross-electrophile couplings, chloride cross-electrophile couplings, deoxygenative couplings, and amino radical transfer (ART) couplings, augmented with additional experiments curated by a systematic cheminformatics method to broaden the alkyl halides scope. We show on a blind literature data set that such a QM-ML approach can successfully predict the feasibility of complex reactions from heterogeneous data sets with minimal data requirements and can generalize it to unseen reaction subtypes with a few-shot learning approach, affording a computational model for ART coupling. Together, these capabilities provide a data-efficient solution for rapidly predicting the outcome of cross-coupling reactions and facilitate the adoption of nickel photocatalysis in the MAKE stage of the DMTA cycle.
创建时间:
2026-02-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作