SNLI Corpus
收藏arXiv2017-03-27 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/1607.06025v2
下载链接
链接失效反馈官方服务:
资源简介:
SNLI Corpus是一个包含超过50万例的自然语言推理数据集,用于训练强大的神经网络。数据集中的每个例子包含两个人工编写的句子(前提和假设)以及描述它们之间关系的相应标签。该数据集足够大,可以训练出高效的分类神经网络,已被用于多种成功的分类神经网络模型中。此数据集的创建旨在通过大量多样化的例子,提高机器学习模型在自然语言理解任务中的性能。
The SNLI Corpus is a natural language inference dataset containing over 500,000 examples, designed for training robust neural networks. Each example in the dataset comprises two manually written sentences (a premise and a hypothesis) together with the corresponding label that describes their relational connection. This large-scale dataset enables the training of high-performance classification neural networks, and has been utilized in multiple successful classification neural network models. The dataset was developed with the aim of enhancing the performance of machine learning models on natural language understanding tasks via a large collection of diverse examples.
提供机构:
斯洛文尼亚约瑟夫·斯特凡研究所和约瑟夫·斯特凡国际研究生院
创建时间:
2016-07-21



