TokenSwift/llama3.1_pg19_train_data
收藏Hugging Face2025-02-17 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/TokenSwift/llama3.1_pg19_train_data
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含int32类型序列特征(input_ids)的数据集,包含一个训练集(train split),共有27332个样本,数据集大小为1,076,886,272字节。数据集的下载大小为487,651,276字节。数据集配置为默认配置(default config),训练数据文件路径为data/train-*。
This dataset includes a sequence feature of type int32 named input_ids, with one training set (train split) containing 27,332 samples, and the total dataset size is 1,076,886,272 bytes. The download size of the dataset is 487,651,276 bytes. The dataset is configured with the default setting (default config), and the training data file path is data/train-*.
提供机构:
TokenSwift



