mteb/NusaTranslationBitextMining
收藏Hugging Face2025-05-07 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/mteb/NusaTranslationBitextMining
下载链接
链接失效反馈官方服务:
资源简介:
NusaTranslation是一个平行数据集,用于印度尼西亚11种语言和英语之间的机器翻译。数据集包含多个语言配置,如abs、bbc、bew等,并且有详细的统计信息。数据集是通过人工标注的,遵循CC-BY-SA-4.0许可证。数据集可以用于多语言任务,并且提供了如何评估模型的信息。
NusaTranslation is a parallel dataset for machine translation on 11 Indonesia languages and English. The dataset includes multiple language configurations such as abs, bbc, bew, etc., and provides detailed statistics. The dataset is human-annotated and licensed under CC-BY-SA-4.0. It can be used for multilingual tasks and provides information on how to evaluate models.
提供机构:
mteb



