five

SNLI Dataset

收藏
paperswithcode.com2025-01-21 收录
下载链接:
https://paperswithcode.com/dataset/snli
下载链接
链接失效反馈
官方服务:
资源简介:
The SNLI dataset (Stanford Natural Language Inference) consists of 570k sentence-pairs manually labeled as entailment, contradiction, and neutral. Premises are image captions from Flickr30k, while hypotheses were generated by crowd-sourced annotators who were shown a premise and asked to generate entailing, contradicting, and neutral sentences. Annotators were instructed to judge the relation between sentences given that they describe the same event. Each pair is labeled as “entailment”, “neutral”, “contradiction” or “-”, where “-” indicates that an agreement could not be reached.

斯坦福自然语言推断数据集(SNLI)包含57万对句子,这些句子由人工标注为包含蕴涵、矛盾和中立关系。其中,前提来自Flickr30k的图像标题,而假设句则由众包标注员生成,标注员在看到前提后需生成蕴涵、矛盾和中立的句子。标注员在判断句子之间的关系时,需假定它们描述的是同一事件。每对句子被标注为“蕴涵”、“中立”、“矛盾”或“-”,其中“-”表示无法达成一致。
提供机构:
Papers with Code
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作