SNLI Dataset

Name: SNLI Dataset
Creator: Papers with Code
License: 暂无描述

paperswithcode.com2025-01-21 收录

下载链接：

https://paperswithcode.com/dataset/snli

下载链接

链接失效反馈

官方服务：

资源简介：

The SNLI dataset (Stanford Natural Language Inference) consists of 570k sentence-pairs manually labeled as entailment, contradiction, and neutral. Premises are image captions from Flickr30k, while hypotheses were generated by crowd-sourced annotators who were shown a premise and asked to generate entailing, contradicting, and neutral sentences. Annotators were instructed to judge the relation between sentences given that they describe the same event. Each pair is labeled as “entailment”, “neutral”, “contradiction” or “-”, where “-” indicates that an agreement could not be reached.

斯坦福自然语言推断数据集（SNLI）包含57万对句子，这些句子由人工标注为包含蕴涵、矛盾和中立关系。其中，前提来自Flickr30k的图像标题，而假设句则由众包标注员生成，标注员在看到前提后需生成蕴涵、矛盾和中立的句子。标注员在判断句子之间的关系时，需假定它们描述的是同一事件。每对句子被标注为“蕴涵”、“中立”、“矛盾”或“-”，其中“-”表示无法达成一致。

提供机构：

Papers with Code

5,000+

优质数据集

54 个

任务类型

进入经典数据集