COMPRISE_Data10_MENYO-20K_V1.0
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5997744
下载链接
链接失效反馈官方服务:
资源简介:
A multi-domain parallel corpus for English-Yoruba language pair that can be used to benchmark machine translation systems. The dataset has 20,100 parallel sentences split into 10,070 train, 3,397 dev, and 6,633 test sentences.
创建时间:
2022-02-11



