altaidevorg/PubChem-SMILES-SELFIES-InChI-IUPAC-v2
收藏Hugging Face2026-01-22 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/altaidevorg/PubChem-SMILES-SELFIES-InChI-IUPAC-v2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个化学分子数据集,包含249,642个训练样本。每个样本包含化学分子的多种表示方式,如CID、SMILES、SMILES_Canonical、SELFIES、inchi、iupac和formula,以及相关的图像数据(image_1、image_2、image_3)和图像来源(image_source)。数据集总大小为12,138,024,966字节。
This dataset is a chemical molecule dataset containing 249,642 training samples. Each sample includes multiple representations of chemical molecules, such as CID, SMILES, SMILES_Canonical, SELFIES, inchi, iupac, and formula, as well as related image data (image_1, image_2, image_3) and image source (image_source). The total size of the dataset is 12,138,024,966 bytes.
提供机构:
altaidevorg



