MTL-QA : A dataset and multi-task learning approach for knowledge graph and natural language question answering

NIAID Data Ecosystem2026-03-14 收录

下载链接：

https://zenodo.org/record/7456299

下载链接

链接失效反馈

官方服务：

资源简介：

The dataset used for this project is created by enhancing the publicly available MetaQA (Movie Text Audio QA), which is primarily a KGQA dataset pertaining to movies, an extension of WikiMovies. This involves questions requiring 1, 2, and 3 hops which can be answered by using a MetaQA Knowledge Graph. The questions are available in text and audio format. The text has vanilla (original) and its paraphrased version, and is called ntm. In order to develop a dataset to support NLQA, a series of dataset augmentation steps has been performed. The dataset consists of natural language questions and a tagged topic entity as ground truth. This topic entity is used to retrieve textual information related to the question from Wikipedia. The introduction section of the entity's page is used as the context that is required for NLQA. Hence, this dataset has information related to both KGQA and NLQA. Certain preliminary checks and validations are done to only retain those data samples whose context can be used to answer a given question.

创建时间：

2022-12-19

5,000+

优质数据集

54 个

任务类型

进入经典数据集