five

ORDerly supplementary datasets

收藏
Figshare2024-02-05 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/ORDerly_datasets/23502372/3
下载链接
链接失效反馈
官方服务:
资源简介:
Supplementary datasets used in ORDerly (i.e. the non-benchmark datasets)Condition prediction datasets: Contains parquet files for each of the four flavours of ORDerly-condition datasets that we used in the ORDerly paper. Condition prediction datasets config: Contains the .log and .json files showing the parameters used in cleaning and the impact on dataset size after each cleaning step.Transformer datasets: Contains plain txt files with the six transformer-ready datasets that were used for training/testing with Molecular Transformer. Non uspto data: Contains the datasets created with ORDerly from non-USPTO data in ORD. These datasets were used as test sets for forward prediction and retrosynthesis prediction.Preprint: https://chemrxiv.org/engage/chemrxiv/article-details/64ca5d3e4a3f7d0c0d78ca42Neurips workshop paper: https://openreview.net/forum?id=R8FQMsECISCode: https://github.com/sustainable-processes/orderlyThe ORDerly benchmark datasets can be found here: https://figshare.com/articles/dataset/ORDerly_chemical_reactions_condition_benchmarks/23298467Please feel free to contact me, Daniel Wigh, at dsw46@cam.ac.uk in case of any questions.
提供机构:
Felton, Kobi; Pomberger, Alexander; Lapkin, Alexei A.; arrowsmith, Joe; Wigh, Daniel
创建时间:
2024-02-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作