ChavyvAkvar/Ultra-Fineweb-1M-Sample-62
收藏Hugging Face2025-09-26 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/ChavyvAkvar/Ultra-Fineweb-1M-Sample-62
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含content、score和source三个字段的数据集,其中content为文本内容,score为分数,source为数据来源。数据集包含一个训练集,共有1000000个示例,总大小为3.98GB。同时提供了一个默认配置,用于指定训练数据的文件路径。
This dataset consists of three fields: content (string type), score (float type), and source (string type). The content represents text content, the score represents a numerical value, and the source indicates the origin of the data. The dataset includes a training set with a total of 1,000,000 examples and a total size of 3.98GB. A default configuration is provided to specify the file path for the training data.
提供机构:
ChavyvAkvar



