five

Developing Deep Learning-based Large-scale Organic Reaction Classification Model via Sigma-profiles

收藏
DataCite Commons2024-06-28 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/Develop_Deep_Learning-based_Large-scale_Reaction_Classification_Model_via_Sigma-profiles/24619197
下载链接
链接失效反馈
官方服务:
资源简介:
The "<b>Train_AE.zip</b>" contains the scripts for training an auto-encoder.The "<b>Train_DL_Models.zip</b>" contains the scripts for training deep learning-based models.The "<b>sigma_profiles_dict.npy</b>" contains the sigma-profiles of millions of different molecules. The SMILES of a molecule is used as key to query the corresponding sigma-profiles.The "<b>sorted_agent_dict.npy</b>" contains the statistical results of USPTO_TPL dataset concerning the frequency of occurrence of agents. The agents are shown in an descending manner.The "<b>sorted_agent_combination_dict.npy</b>" contains the statistical results of USPTO_TPL dataset concerning the frequency of occurrence of agent combinations. The combinations are shown in an descending manner.<br>The "<b>USPTO_TPL_own_version.xlsx</b>" contains the reactions that used for training/validation/testing.<br>

<b>Train_AE.zip</b> 内含自编码器(auto-encoder)的训练脚本。<b>Train_DL_Models.zip</b> 内含基于深度学习的各类模型的训练脚本。<b>sigma_profiles_dict.npy</b> 存储了数百万种不同分子的σ特征谱(sigma-profiles),以分子的SMILES作为查询键即可检索对应的σ特征谱。<b>sorted_agent_dict.npy</b> 包含USPTO_TPL数据集内试剂出现频率的统计结果,试剂按出现频次降序排列。<b>sorted_agent_combination_dict.npy</b> 包含USPTO_TPL数据集内试剂组合出现频率的统计结果,试剂组合按出现频次降序排列。<br><b>USPTO_TPL_own_version.xlsx</b> 包含用于模型训练、验证与测试的化学反应数据。
提供机构:
figshare
创建时间:
2023-11-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作