SelQA

Name: SelQA
Creator: 埃默里大学
Published: 2016-10-28 09:20:19
License: 暂无描述

arXiv2016-10-28 更新2024-08-06 收录

下载链接：

http://arxiv.org/abs/1606.08513v3

下载链接

链接失效反馈

官方服务：

资源简介：

SelQA是一个新的选择题问答数据集，由埃默里大学创建。该数据集包含从英文维基百科的十大流行主题中提取的问题和答案，通过众包方式生成。数据集的创建过程涉及一系列众包任务，旨在减少问题和答案之间的词汇共现，从而生成大规模、多样化和具有挑战性的数据集。SelQA数据集主要用于评估开放领域问答系统的阅读理解能力，特别是在答案句选择和答案触发任务中。

SelQA is a novel multiple-choice question answering (QA) dataset created by Emory University. It consists of questions and answers extracted from ten popular topics in English Wikipedia, generated via crowdsourcing. The dataset construction involves a series of crowdsourcing tasks, which aim to reduce lexical co-occurrences between questions and answers, thereby producing a large-scale, diverse and challenging dataset. The SelQA dataset is primarily used to evaluate the reading comprehension capabilities of open-domain question answering systems, particularly in answer sentence selection and answer triggering tasks.

提供机构：

埃默里大学

创建时间：

2016-06-28

5,000+

优质数据集

54 个

任务类型

进入经典数据集