MolClassifier Training and Validation Datasets
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10978563
下载链接
链接失效反馈官方服务:
资源简介:
The dataset contains 18626 chemical images (15720 for training and 2906 for validation) with annotated classes: `Molecular Structure`, `Markush Structure` and `Background`.
Selected chemical images are randomly selected from the outputs of a segmentation module applied to documents from the United States Patent and Trademark Office.
This dataset is part of PatCID: an open-access dataset of chemical structures in patent documents.
创建时间:
2024-06-18



