CHINAYWX/lang_complex_max_dependency_length_3_tokenizered
收藏Hugging Face2024-09-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/CHINAYWX/lang_complex_max_dependency_length_3_tokenizered
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:input_ids(int32类型)、attention_mask(int8类型)和labels(int64类型)。数据集分为训练集和测试集,训练集有100万个样本,测试集有5000个样本。数据集的总大小为738,418,152字节,下载大小为257,713,097字节。
The dataset contains three fields: input_ids (int32 type), attention_mask (int8 type), and labels (int64 type). It is divided into a training set and a test set, with 1,000,000 samples in the training set and 5,000 samples in the test set. The total size of the dataset is 738,418,152 bytes, and the download size is 257,713,097 bytes.
提供机构:
CHINAYWX



