QADS
收藏arXiv2020-10-25 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2010.13049v1
下载链接
链接失效反馈官方服务:
资源简介:
QADS是由南洋理工大学-阿里巴巴联合研究院创建的一个问答数据集,专注于机器阅读理解中同义词常识的处理。该数据集基于SQuAD 2.0,通过应用WordNet中的同义词知识自动生成问题。数据集包含超过5000个问题,旨在测试和改进机器理解模型处理同义词常识的能力。创建过程中,使用了增强的Lesk算法进行词义消歧,并通过众包工作者进行最终的验证和筛选。QADS的应用领域主要集中在提升机器阅读理解模型在处理复杂语言现象时的性能,特别是在理解和应用同义词常识方面。
QADS is a question answering dataset created by the Nanyang Technological University-Alibaba Joint Research Institute, which focuses on addressing synonym-related common sense in machine reading comprehension (MRC). Built upon SQuAD 2.0, this dataset automatically generates questions by leveraging synonym knowledge from WordNet. It contains over 5,000 questions, with the core goal of evaluating and enhancing the capability of machine comprehension models to process synonym-related common sense. During the dataset development process, an enhanced Lesk algorithm was adopted for word sense disambiguation (WSD), and final validation and screening were performed by crowdsourcing workers. The primary application scenarios of QADS center on boosting the performance of MRC models when dealing with complex linguistic phenomena, especially in understanding and applying synonym-related common sense.
提供机构:
南洋理工大学-阿里巴巴联合研究院
创建时间:
2020-10-25
搜集汇总
数据集介绍

背景与挑战
背景概述
QADS是由南洋理工大学-阿里巴巴联合研究院创建的问答数据集,专注于机器阅读理解中同义词常识的处理。它基于SQuAD 2.0,通过WordNet自动生成超过5000个问题,旨在测试和改进模型处理同义词常识的能力,应用领域包括提升机器阅读理解模型在复杂语言现象中的性能。
以上内容由遇见数据集搜集并总结生成



