hendrydong/fineweb-edu-1BT
收藏Hugging Face2025-11-07 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/hendrydong/fineweb-edu-1BT
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含三个数据分割:验证集、测试集和训练集。每个分割都包含了大量的文本数据示例,其中验证集有967个示例,测试集有968个示例,而训练集则包含了967016个示例。数据集的总大小约为4.61GB,下载大小约为2.75GB。数据集的唯一特征是文本内容。
The dataset consists of three splits: validation set, test set, and training set. Each split contains a large number of text data examples, with the validation set having 967 examples, the test set having 968 examples, and the training set containing 967,016 examples. The total size of the dataset is approximately 4.61GB, and the download size is about 2.75GB. The only feature of the dataset is the text content.
提供机构:
hendrydong



