five

CHORISO - chemical reaction SMILES from academic journals

收藏
Figshare2023-12-15 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/CHORISO_-_chemical_reaction_SMILES_from_academic_journals/22598230
下载链接
链接失效反馈
官方服务:
资源简介:
CHORISO (CHemical Organic ReactIon SMILES Omnibus) is a curated dataset containing chemical reactions SMILES extracted from high-impact factor journals. It is built using the CJHIF dataset, and the resulting data is used to propose a new holistic evaluation of reaction prediction models (see paper). A detailed explanation of the processing steps and proposed metrics is included in this repo.The following files are included:choriso_public.tar.gz: compressed file containing the ChORISO dataset, 2'224'239 canonical reaction SMILES.uspto_public.tar.gz: file containing the USPTO dataset cleaned and processed following the same pipeline than CHORISO.splits.tar.gz: compressed folder containing the training, validation and test files used to train and evaluate models in the study.
创建时间:
2023-12-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作