yangzhang33/pcqm4mv2_sequence_cyc_0_0_w_b
收藏Hugging Face2025-07-08 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/yangzhang33/pcqm4mv2_sequence_cyc_0_0_w_b
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含SMILES和sequence两个字符串字段的数据集,分为训练集、验证集和测试集,共计约367万条记录。数据集的总大小约为3.55GB,下载大小约为561MB。提供了默认配置,指定了各个数据集分片的文件路径。
This dataset includes two string fields, SMILES and sequence, and is divided into training, validation, and test sets, with a total of approximately 3.67 million records. The total size of the dataset is about 3.55GB, and the download size is about 561MB. A default configuration is provided, specifying the file paths for each dataset split.
提供机构:
yangzhang33



