Entity Relatedness Test Data
收藏DataCite Commons2020-09-01 更新2024-07-27 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Entity_Relatedness_Test_Data/5234701/1
下载链接
链接失效反馈官方服务:
资源简介:
The entity relatedness problem refers to the question of computing the relationship paths that better describe the connectivity between a given entity pair. <br>This dataset supports the evaluation of approaches that address the entity relatedness problem. It covers two familiar domains, music and movies, and uses data available in IMDb and last.fm, which are popular reference datasets in these domains. <br>The dataset contains 20 entity pairs from each of these domains and, for each entity pair, a ranked list with 50 relationship paths. It also contains entity ratings and property relevance scores for the entities and properties used in the paths.<br>The data is compressed in .zip format and can be uncompressed by standard compression utilities. The data are split into three archives:<br><b>EntityRelatednessTestData to RDF.zip:</b> contains raw (.txt) and rdf test data along with test scripts (.java) and java class (.class) files. <br><b><br></b><b>ontology.zip: </b>contains the .rdf ontology for the entity relatedness test dataset<br><br><b>dataset.zip: </b>contains the entity relatedness test dataset in .rdf, .ttl and .nt formats<br>The underlying data and code can be accessed through standard text edit software.<br>
实体关联问题(Entity Relatedness Problem)指的是求解能够更恰当地描述给定实体对之间连通性的关系路径的研究问题。
本数据集用于为实体关联问题相关方法的评估提供支撑。其覆盖音乐与电影两大主流领域,采用了这两大领域中广受认可的参考数据集互联网电影数据库(IMDb)与last.fm中的公开数据。
该数据集在每个领域中均包含20组实体对,且针对每一组实体对,提供了包含50条关系路径的排序列表。此外,数据集还涵盖了路径中所使用的实体与属性的评分、属性相关性得分。
本数据集以.zip格式进行压缩,可通过标准压缩工具完成解压。数据分为三个归档文件:
**EntityRelatednessTestData to RDF.zip**:包含原始(.txt)测试数据与资源描述框架(RDF)测试数据,同时附带测试脚本(.java)及Java类(.class)文件。
**ontology.zip**:包含本实体关联测试数据集所使用的资源描述框架(RDF)格式本体文件。
**dataset.zip**:包含以.rdf、.ttl及.nt格式存储的实体关联测试数据集。
相关基础数据与代码可通过标准文本编辑软件访问。
提供机构:
figshare
创建时间:
2017-07-26



