jablonkagroup/chempile-mix-100m
收藏Hugging Face2025-07-29 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/jablonkagroup/chempile-mix-100m
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本数据的的数据集,分为训练集、验证集和测试集三部分。训练集包含245,822个示例,大小为335,481,125字节;验证集包含11,956个示例,大小为16,269,108字节;测试集包含12,483个示例,大小为17,237,898字节。数据集总共大小为368,988,132字节。
This is a dataset containing text data, divided into three parts: training set, validation set, and test set. The training set contains 245,822 examples with a size of 335,481,125 bytes; the validation set contains 11,956 examples with a size of 16,269,108 bytes; the test set contains 12,483 examples with a size of 17,237,898 bytes. The total size of the dataset is 368,988,132 bytes.
提供机构:
jablonkagroup



