jiyeonkim/OLMo_C4_data_1k
收藏Hugging Face2024-08-20 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/jiyeonkim/OLMo_C4_data_1k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为train的分割,共有1,238,634个示例,总大小为5,078,399,400字节。数据集的特征包括一个名为input_ids的序列,数据类型为int32。数据集的下载大小为2,508,956,865字节。
The dataset includes a split named train with 1,238,634 examples and a total size of 5,078,399,400 bytes. The features of the dataset include a sequence named input_ids with a data type of int32. The download size of the dataset is 2,508,956,865 bytes.
提供机构:
jiyeonkim



