T-Rex : A Large Scale Alignment of Natural Language with Knowledge Base Triples

Name: T-Rex : A Large Scale Alignment of Natural Language with Knowledge Base Triples
Creator: figshare
Published: 2020-09-02 06:59:39
License: 暂无描述

DataCite Commons2020-09-02 更新2024-07-25 收录

下载链接：

https://figshare.com/articles/dataset/T-Rex_A_Large_Scale_Alignment_of_Natural_Language_with_Knowledge_Base_Triples/5146864

下载链接

链接失效反馈

官方服务：

资源简介：

Several datasets with alignments between knowledge base triples and free text have been built, for several independent tasks, such as Relation Extraction, Knowledge base population, Relation Discovery…But where the others datasets have a small number of documents (TAC-KBP), only have the relations without the original documents (FB15K-237), have a little amount of relations (Google-RE), we present a dataset containing large-scale high-quality alignments between DBpedia abstracts and Wikidata triples, T-REx.

目前已针对关系抽取（Relation Extraction）、知识库填充（Knowledge base population）、关系发现（Relation Discovery）等多项独立任务，构建了若干实现知识库三元组（knowledge base triples）与自由文本（free text）对齐的数据集。然而现有同类数据集均存在不同程度的局限：TAC-KBP的文档数量有限，FB15K-237仅包含关系而无原始文本，Google-RE的关系体量偏小。为此，我们提出了T-REx数据集，该数据集包含DBpedia摘要与Wikidata三元组之间的大规模高质量对齐数据。

提供机构：

figshare

创建时间：

2017-06-27

5,000+

优质数据集

54 个

任务类型

进入经典数据集