WinoWhy

arXiv2025-09-30 收录

下载链接：

https://github.com/colinzhaoust/winowhy

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为WinoWhy，旨在评估自然语言处理中的常识推理能力，其核心任务是为给定的语境找出合理的解释。作者未提供训练集、验证集和测试集的划分，模型需在整个数据集上进行训练。具体任务为具有多个解释的演绎推理。

This dataset, named WinoWhy, is designed to evaluate commonsense reasoning capabilities in natural language processing (NLP). Its core task is to identify plausible explanations for a given context. The dataset's authors do not provide predefined training, validation, and test set splits, so models must be trained on the full dataset. The specific task is deductive reasoning with multiple candidate explanations.

5,000+

优质数据集

54 个

任务类型

进入经典数据集