BLEU
收藏arXiv2025-09-30 收录
下载链接:
https://en.wikipedia.org/wiki/BLEU
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在评估生成文本的质量,其中使用了BLEU评分作为衡量标准。BLEU是一种用于评估机器翻译质量的指标,它也被广泛应用于包括摘要生成和问题回答在内的多种任务中。在计算BLEU分数时,通过比较候选句子与参考句子之间的n-gram来得出结果。该任务的目的是对生成的文本进行有效评估。
This dataset is designed to evaluate the quality of generated text, with BLEU score adopted as the evaluation metric. BLEU (Bilingual Evaluation Understudy) is a metric for evaluating machine translation quality, and it has been widely applied to various tasks including summarization and question answering. When calculating the BLEU score, the final result is derived by comparing n-grams between candidate sentences and reference sentences. The objective of this task is to conduct effective evaluation of generated text.
提供机构:
Various



