five

Constructing and Cleaning Identity Graphs in the LOD Cloud

收藏
科学数据银行2020-10-17 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/en/detail?dataSetId=766941569981874176
下载链接
链接失效反馈
官方服务:
资源简介:
Five tables and six figures of the paper. Table 1 shows evaluation of 200 owl:sameAs links, with each 40 links randomly chosen from a certain range of error degree. The percentages between parentheses are calculated without considering the links evaluated as “can’t tell”. Table 2 presents accuracy of the approach on the four manually evaluated samples, based on a threshold of 0.99. Links evaluated as “can’t tell” by the judges are discarded. Table 3 shows precision, recall and accuracy, based on two thresholds (0.99 and 0.4) for the Barack Obama identity set. Links evaluated as “can’t tell” by the judges are discarded.Table 4 is comparison of the original identity network closure, the closure (b) and (c), with the Gold Standard. Table 5 shows precision, recall and accuracy evaluation of the three closures. Figure 1 is the workflow of the identity network extraction, compaction and closure. Figure 2 depicts the distribution of identity set cardinality in Gim. The x-axis lists all 48,999,148 non-singleton identity sets. Figure 3 shows error degree distribution of all owl:sameAs statements in the LOD-a-lot. Figure 4 is comparison of the original identity network and its transitive closure, with the two newly constructed identity subgraphs.Figure 5 shows‘Barack Obama’ identity cluster. Figure 6 presents community structure of the‘Barack Obama’ identity cluster.
提供机构:
Vrije University; Paris Saclay University
创建时间:
2020-10-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作