YnezT/gpt2-directions-random_walk_fix_len
收藏Hugging Face2024-08-29 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/YnezT/gpt2-directions-random_walk_fix_len
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要特征:input_ids、attention_mask和labels,分别表示输入ID序列、注意力掩码序列和标签序列。数据集分为训练集和测试集,训练集包含2000万个样本,测试集包含20万个样本。数据集的下载大小为3858886435字节,总大小为34643000000字节。数据文件路径分别为data/train-*和data/test-*。
The dataset contains three main features: input_ids, attention_mask, and labels, representing input ID sequences, attention mask sequences, and label sequences, respectively. The dataset is divided into a training set and a test set, with the training set containing 20 million samples and the test set containing 200,000 samples. The download size of the dataset is 3858886435 bytes, and the total size is 34643000000 bytes. The data file paths are data/train-* and data/test-*.
提供机构:
YnezT



