Parallel Bible Corpus
收藏arXiv2025-09-30 收录
下载链接:
https://pypi.org/project/sacrebleu/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从平行圣经语料库中筛选出来的一部分,专注于36种使用变音符号的非洲语言。它为涉及机器翻译和变音符号标注的实验提供了正确且一致的变音符号标注数据。该数据集覆盖了36种语言,主要针对机器翻译和变音符号标注的任务。
This dataset is a curated subset extracted from the parallel Bible corpus, focusing on 36 African languages that employ diacritics. It provides correctly and consistently annotated diacritic data for experiments related to machine translation and diacritic tagging. This dataset covers 36 languages and is primarily targeted at machine translation and diacritic tagging tasks.
提供机构:
Parallel Bible Corpus



