atlasia/Fineweb2-preds
收藏Hugging Face2024-12-15 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/atlasia/Fineweb2-preds
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和预测两个主要特征,其中预测特征进一步细分为预测置信度和预测标签。数据集分为训练集和测试集,训练集包含69,181,074个示例,测试集包含404,456个示例。总下载大小为9,121,800,508字节,数据集总大小为18,211,162,825字节。
The dataset includes two main features: text and prediction, with the prediction feature further divided into prediction confidence and prediction label. The dataset is split into a training set and a test set, with the training set containing 69,181,074 examples and the test set containing 404,456 examples. The total download size is 9,121,800,508 bytes, and the total dataset size is 18,211,162,825 bytes.
提供机构:
atlasia



