Nithish2410/fineweb_edu_3_0_600
收藏Hugging Face2025-11-07 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/Nithish2410/fineweb_edu_3_0_600
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本、评分和文档长度的数据集,用于训练模型。数据集包含一个训练集,共有1000万个示例,大小为约20GB。数据集的下载大小约为12GB。
This dataset includes text, score, and document length, which is used for model training. The dataset contains a training split with 10 million examples, totaling about 20GB in size. The download size of the dataset is about 12GB.
提供机构:
Nithish2410



