b3x0m/data
收藏Hugging Face2024-11-21 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/b3x0m/data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含用于训练和测试的文本数据,特征包括输入ID(input_ids)、注意力掩码(attention_mask)和标签(labels)。训练集包含2,033,481个样本,占用54,163,799,916字节;测试集包含225,943个样本,占用6,018,217,748字节。数据集总大小为60,182,017,664字节,下载大小为293,233,853字节。
This dataset contains text data for training and testing, with features including input IDs (input_ids), attention masks (attention_mask), and labels (labels). The training set consists of 2,033,481 samples, occupying 54,163,799,916 bytes; the test set consists of 225,943 samples, occupying 6,018,217,748 bytes. The total dataset size is 60,182,017,664 bytes, with a download size of 293,233,853 bytes.
提供机构:
b3x0m



