斯坦福自然语言推理语料库
收藏arXiv2015-08-22 更新2024-07-25 收录
下载链接:
https://nlp.stanford.edu/projects/snli/
下载链接
链接失效反馈官方服务:
资源简介:
斯坦福自然语言推理语料库(SNLI)是一个大规模的标注语料库,由斯坦福语言学创建,包含570,152对人工编写的句子,用于自然语言推理研究。该数据集基于图像标注任务,旨在通过大规模资源推动机器学习在该领域的研究。SNLI数据集不仅规模庞大,且所有句子和标签均由人类在自然情境下编写,确保了数据的高质量和可靠性。此数据集适用于训练参数丰富的模型,如神经网络,以评估和提升自然语言推理能力,特别是在解决信息检索、语义解析和常识推理等任务中的应用。
The Stanford Natural Language Inference Corpus (SNLI) is a large-scale annotated corpus developed by Stanford Linguistics, containing 570,152 manually written sentence pairs for natural language inference research. Built upon image annotation tasks, this dataset aims to advance machine learning research in this field via large-scale resources. SNLI not only boasts a large scale, but all sentences and labels were written by humans in natural contexts, ensuring the high quality and reliability of the dataset. This corpus is suitable for training parameter-rich models such as neural networks to evaluate and enhance natural language inference capabilities, particularly for applications in tasks including information retrieval, semantic parsing, and commonsense reasoning.
提供机构:
斯坦福语言学
创建时间:
2015-08-22



