WN18RR
收藏DataCite Commons2020-08-25 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/WN18RR/11911272
下载链接
链接失效反馈官方服务:
资源简介:
WN18RR is derived from WN18, with data removed to eliminate test-set leakage due to inverse relations. WN18RR contains 93003 triples, connecting 40943 entities via 11 relations.<br><br>Info about WN18: This WORDNET TENSOR DATA consists of a collection of triplets (synset, relation_type, triplet) extracted from WordNet 3.0 (http://wordnet.princeton.edu). This data set can be seen as a 3-mode tensor depicting ternary relationships between synsets.<br>All *.txt files contain one triplet per line, with 2 synset_ids and relation type identifier in a tab separated format. The first element is the synset_id of the left hand side of the relation triple, the third one is the synset_id of the right hand side and the second element is the name of the type of relations between them.<br>The WN18RR.zip file contains the other files, with more compression than the default "download all".<br>
WN18RR 源自 WN18,为消除因反向关系引发的测试集泄露问题,移除了部分数据。WN18RR 共包含93003条三元组(triple),通过11种关系连接40943个实体。<br><br>WN18数据集说明:该词网张量数据集(WORDNET TENSOR DATA)由从WordNet 3.0(http://wordnet.princeton.edu)中提取的三元组集合构成,其结构为(同义词集(synset)、关系类型、三元组(triple))。该数据集可被视为一个刻画同义词集(synset)间三元关系的三阶张量。所有*.txt文件每行均包含一条三元组(triple),以制表符分隔格式存储两个同义词集ID与关系类型标识符。其中第一个元素为该关系三元组左侧的同义词集ID,第三个元素为右侧的同义词集ID,第二个元素则为二者间的关系类型名称。WN18RR.zip 文件包含其余文件,其压缩率高于默认的"download all"压缩方式。
提供机构:
figshare
创建时间:
2020-02-27
搜集汇总
数据集介绍

背景与挑战
背景概述
WN18RR是一个基于WordNet 3.0的知识图谱数据集,包含93003个三元组,连接40943个实体,涉及11种关系。该数据集通过移除反向关系解决了WN18中的测试集泄漏问题,数据以文本格式存储,每行一个三元组。
以上内容由遇见数据集搜集并总结生成



