esenergun/wikitext-pos
收藏Hugging Face2024-10-17 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/esenergun/wikitext-pos
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要特征:word_lines(字符串类型)、label_lines(int64类型的序列)和merge_error(布尔类型)。数据集分为训练集、验证集和测试集,分别包含1,801,350、3,760和4,358个样本。数据集的下载大小为395,896,629字节,总大小为1,368,497,386字节。数据文件分别存储在train-*、validation-*和test-*路径下。
The dataset contains three main features: word_lines (string type), label_lines (sequence of int64), and merge_error (boolean type). The dataset is divided into three parts: training set (train), validation set (validation), and test set (test), containing 1,801,350, 3,760, and 4,358 samples respectively. The download size of the dataset is 395,896,629 bytes, and the total size is 1,368,497,386 bytes. The data files are stored in the paths train-*, validation-*, and test-*.
提供机构:
esenergun



