suku9/enamine_smiles_twothird_sample
收藏Hugging Face2025-02-25 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/suku9/enamine_smiles_twothird_sample
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为smiles的字符串类型特征。数据集分为训练集、验证集和测试集,其中训练集包含约450亿个示例,验证集包含约6亿4千万个示例,测试集包含约12亿8千万个示例。数据集的总大小约为27.6GB。
The dataset contains a single feature named smiles of string data type. It is divided into training, validation, and test sets, with the training set having approximately 45 billion examples, the validation set having approximately 643 million examples, and the test set having approximately 1.287 billion examples. The total size of the dataset is about 27.6GB.
提供机构:
suku9



