sumuks/Ultra-FineWeb-10B
收藏Hugging Face2025-06-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/sumuks/Ultra-FineWeb-10B
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:内容(content),评分(score),来源(source)。内容字段为文本数据,评分字段为浮点数,来源字段为文本数据。数据集被划分为训练集,共有10143687个示例,大小为40417147575字节。此外,数据集的下载大小为23409321168字节。
The dataset includes three fields: content, score, and source. The content field is text data, the score field is a floating-point number, and the source field is text data. The dataset is split into a training set with a total of 10,143,687 examples, with a size of 40,417,147,575 bytes. Additionally, the download size of the dataset is 23,409,321,168 bytes.
提供机构:
sumuks



