EntailmentBank
收藏arXiv2025-09-30 收录
下载链接:
https://allenai.org/data/entailmentbank
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了339个蕴含对,用于评估语言模型信念的一致性。它被用于衡量自我一致性,并评估REFLEX方法,涉及的任务包括多项选择题回答以及自我一致性评估。
This dataset contains 339 entailment pairs for evaluating the consistency of language model beliefs. It is used to measure self-consistency and assess the REFLEX method, covering tasks including multiple-choice question answering and self-consistency assessment.



