MLQE-PE Dataset
收藏paperswithcode.com2025-03-22 收录
下载链接:
https://paperswithcode.com/dataset/mlqe-pe
下载链接
链接失效反馈官方服务:
资源简介:
The Multilingual Quality Estimation and Automatic Post-editing (MLQE-PE) Dataset is a dataset for Machine Translation (MT) Quality Estimation (QE) and Automatic Post-Editing (APE). The dataset contains seven language pairs, with human labels for 9,000 translations per language pair in the following formats: sentence-level direct assessments and post-editing effort, and word-level good/bad labels. It also contains the post-edited sentences, as well as titles of the articles where the sentences were extracted from, and the neural MT models used to translate the text.
《多语言质量评估与自动后编辑(MLQE-PE)数据集》系针对机器翻译(MT)质量评估(QE)与自动后编辑(APE)而构建的数据集。该数据集涵盖了七个语言对,包含每对语言中9000条翻译的人为标注,标注形式包括句子级别的直接评估及后编辑工作量,以及单词级别的优良标签。此外,数据集还包含了经过后编辑的句子,以及提取句子的文章标题,以及用于翻译文本的神经机器翻译(NMT)模型。
提供机构:
Papers with Code



