TokenSwift/qwen2.5_pg19_train_data
收藏Hugging Face2025-02-17 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/TokenSwift/qwen2.5_pg19_train_data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含整数序列作为特征,具体为input_ids字段,该字段是int32类型。数据集分为训练集(train),共有27352个示例,总大小为1,083,419,904字节。数据集的下载大小为487,720,508字节。目前数据集只有一个配置(default),其中指定了训练集文件的路径。
The dataset includes integer sequences as features, specifically the input_ids field, which is of type int32. The dataset is split into a training set (train) with a total of 27,352 examples and a total size of 1,083,419,904 bytes. The download size of the dataset is 487,720,508 bytes. Currently, there is only one configuration (default), which specifies the path to the training set files.
提供机构:
TokenSwift



