five

FLoRes-101 Dataset

收藏
paperswithcode.com2025-03-22 收录
下载链接:
https://paperswithcode.com/dataset/flores-101
下载链接
链接失效反馈
官方服务:
资源简介:
FLoRes-101 is an evaluation benchmark for low-resource and multilingual machine translation. It consists of 3001 sentences extracted from English Wikipedia, covering a variety of different topics and domains. These sentences have been translated into 101 languages by professional translators through a carefully controlled process. The FLoRes-101 dataset was introduced to address the lack of good evaluation benchmarks for low-resource languages. It enables better assessment of model quality in these languages and allows for the evaluation of many-to-many multilingual translation systems, as all translations are multilingually aligned.

FLoRes-101是一项针对低资源多语言机器翻译的评估基准。该数据集由3001个句子组成,这些句子源自英文维基百科,涵盖众多不同的主题和领域。这些句子经过专业翻译人员的精心翻译,涉及101种语言。FLoRes-101数据集的提出旨在填补低资源语言领域内优秀评估基准的不足,它能够更有效地评估模型在这些语言中的质量,并允许对多对多的多语言翻译系统进行评估,因为所有翻译都实现了多语言对齐。
提供机构:
Papers with Code
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作