MultiSynt/nemotron-cc-spanish-opus-qe|机器翻译数据集|翻译质量评估数据集
收藏hugging_face2025-10-09 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/MultiSynt/nemotron-cc-spanish-opus-qe
下载链接
链接失效反馈资源简介:
该数据集包含使用Opus进行翻译并使用Unbabel/wmt22-cometkiwi-da进行质量评估的西班牙语翻译质量分数。数据集采用纯文本格式,每个文本行对应翻译西班牙语文本中的一行的质量评估分数。使用END_OF_DOCUMENT标记来分隔文档边界,在西班牙语文本中翻译为FIN DE DOCUMENTO。如果需要文档级别的分数,则需要将句子级别的分数进行汇总。
This dataset includes quality estimation scores for translations into Spanish using Opus and Unbabel/wmt22-cometkiwi-da. The dataset is in plain text format, with each line corresponding to the QE score for a line in the translated Spanish text. END_OF_DOCUMENT markers are used to delimit document boundaries, translated as FIN DE DOCUMENTO in the Spanish text. Document-level scores, if required, would be an aggregated form of the sentence-level scores.
提供机构:
MultiSynt



