kevindf/multi30k_plus90K_fr
收藏Hugging Face2024-07-18 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/kevindf/multi30k_plus90K_fr
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含法语(fr)和英语(en)两种语言的文本数据,每个样本包含一对法语和英语的句子。数据集分为训练集、测试集和验证集,分别包含119,000、4,071和4,014个样本。训练集的大小为8,934,364字节,测试集为323,757字节,验证集为304,876字节。总下载大小为5,048,428字节,数据集总大小为9,562,997字节。
This dataset contains text data in French (fr) and English (en), with each sample consisting of a pair of sentences in French and English. The dataset is divided into training, test, and validation sets, containing 119,000, 4,071, and 4,014 samples respectively. The training set size is 8,934,364 bytes, the test set is 323,757 bytes, and the validation set is 304,876 bytes. The total download size is 5,048,428 bytes, and the total dataset size is 9,562,997 bytes.
提供机构:
kevindf



