five

QNLI Dataset

收藏
paperswithcode.com2025-03-26 收录
下载链接:
https://paperswithcode.com/dataset/qnli
下载链接
链接失效反馈
官方服务:
资源简介:
The QNLI (Question-answering NLI) dataset is a Natural Language Inference dataset automatically derived from the Stanford Question Answering Dataset v1.1 (SQuAD). SQuAD v1.1 consists of question-paragraph pairs, where one of the sentences in the paragraph (drawn from Wikipedia) contains the answer to the corresponding question (written by an annotator). The dataset was converted into sentence pair classification by forming a pair between each question and each sentence in the corresponding context, and filtering out pairs with low lexical overlap between the question and the context sentence. The task is to determine whether the context sentence contains the answer to the question. This modified version of the original task removes the requirement that the model select the exact answer, but also removes the simplifying assumptions that the answer is always present in the input and that lexical overlap is a reliable cue. The QNLI dataset is part of GLUE benchmark.

QNLI(问题回答自然语言推理)数据集系自动从斯坦福问答数据集v1.1(SQuAD)派生而来。SQuAD v1.1包含问题-段落对,其中段落中的一句话(源自维基百科)包含对应问题的答案(由标注者编写)。该数据集通过将每个问题与相应上下文中的每一句话配对,并筛选出问题与上下文句子之间词汇重叠度低的对,转换成句子对分类。任务在于判断上下文句子是否包含问题的答案。此任务版本的修改去除了模型选择确切答案的要求,同时也摒弃了答案始终存在于输入中以及词汇重叠是可靠提示的简化假设。QNLI数据集是GLUE基准测试的一部分。
提供机构:
paperswithcode.com
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作