Data from: Parser extraction of triples in unstructured text

Name: Data from: Parser extraction of triples in unstructured text
Creator: Dryad
Published: 2025-04-01 05:07:18
License: 暂无描述

DataCite Commons2025-04-01 更新2025-04-10 收录

下载链接：

https://datadryad.org/dataset/doi:10.5061/dryad.s7j17qp

下载链接

链接失效反馈

官方服务：

资源简介：

The web contains vast repositories of unstructured text. We investigate the opportunity for building a knowledge graph from these text sources. We generate a set of triples which can be used in knowledge gathering and integration. We define the architecture of a language compiler for processing subject-predicate-object triples using the OpenNLP parser. We implement a depth-first search traversal on the POS tagged syntactic tree appending predicate and object information. A parser enables higher precision and higher recall extractions of syntactic relationships across conjunction boundaries. We are able to extract 2-2.5 times the correct extractions of ReVerb. The extractions are used in a variety of semantic web applications and question answering. We verify extraction of 50,000 triples on the ClueWeb dataset.

提供机构：

Dryad

创建时间：

2019-04-15

5,000+

优质数据集

54 个

任务类型

进入经典数据集