MorisienMT
收藏arXiv2022-06-06 更新2024-06-21 收录
下载链接:
https://huggingface.co/datasets/prajdabre/MorisienMT
下载链接
链接失效反馈官方服务:
资源简介:
MorisienMT是一个用于评估毛里求斯克里奥尔语机器翻译质量的数据集。该数据集由日本信息通信研究机构和毛里求斯大学的研究者创建,包含英毛、法毛和毛里求斯克里奥尔语的单语语料。数据集内容主要来源于书籍翻译,特别是圣经,以及手工创建的基本句子和表达。该数据集旨在解决资源贫乏语言的机器翻译问题,特别是在毛里求斯广泛使用的毛里求斯克里奥尔语(Morisien)。
MorisienMT is a dataset designed for evaluating the machine translation quality of Mauritian Creole. It was developed by researchers from the National Institute of Information and Communications Technology and the University of Mauritius. The dataset includes parallel corpora of English and Mauritian Creole, parallel corpora of French and Mauritian Creole, as well as monolingual corpora of Mauritian Creole itself. The content of the dataset is mainly sourced from book translations, particularly the Bible, and manually created basic sentences and expressions. This dataset aims to address the machine translation challenges of low-resource languages, especially Mauritian Creole (Morisien), which is widely used in Mauritius.
提供机构:
日本信息通信研究机构
创建时间:
2022-06-06



