Hoshikuzu/opus-100-en-ja
收藏Hugging Face2024-08-03 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Hoshikuzu/opus-100-en-ja
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从Helsinki-NLP/opus-100中提取的,包含日语和英语的翻译对。数据集分为训练集、验证集和测试集,分别包含1000000、2000和2000个样本。数据集的下载大小为64068812字节,总大小为88730971字节。数据集的配置文件中指定了数据文件的路径。此外,README文件还提供了如何使用该数据集的示例代码,并引用了相关的学术论文和OPUS项目的文献。
This corpus is extracted from Helsinki-NLP/opus-100, with Japanese and English pairs. The dataset is split into training, development, and test portions, each with specific data quantities. The dataset features include translation dictionaries containing texts from both languages.
提供机构:
Hoshikuzu



