Yujivus/wmt14-de-en-helsinki-filtered-sorted-40
收藏Hugging Face2025-09-28 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Yujivus/wmt14-de-en-helsinki-filtered-sorted-40
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了输入ID序列、注意力掩码、标签、输入长度和标签长度等特征。数据集被划分为训练集、验证集和测试集,其中训练集包含2036097个示例,验证集包含1744个示例,测试集包含1540个示例。数据集的总大小为683154757字节。
The dataset includes features such as input ID sequences, attention masks, labels, input lengths, and label lengths. It is divided into training, validation, and test sets, containing 2036097, 1744, and 1540 examples respectively. The total size of the dataset is 683154757 bytes.
提供机构:
Yujivus



