five

WIKES (Wiki Entity Summarization Benchmark)

收藏
arXiv2024-06-13 更新2024-06-14 收录
下载链接:
https://github.com/msorkhpar/wiki-entity-summarization
下载链接
链接失效反馈
官方服务:
资源简介:
WIKES是一个综合性的知识图谱实体摘要基准数据集,由都灵理工大学等机构创建。该数据集包含约500个种子节点,通过随机漫步方法从Wikidata和Wikipedia中提取,以保持真实世界知识图谱的复杂性。WIKES利用Wikipedia摘要自动生成高质量、无偏见的实体摘要,无需人工标注,适用于多个领域,旨在解决现有数据集规模小、依赖人工标注和忽视图结构信息的问题。

WIKES is a comprehensive knowledge graph entity summarization benchmark dataset developed by institutions including Polytechnic University of Turin. This dataset contains approximately 500 seed nodes, which are extracted from Wikidata and Wikipedia via random walk methods to preserve the complexity of real-world knowledge graphs. WIKES automatically generates high-quality, unbiased entity summaries using Wikipedia abstracts without requiring manual annotation, and is applicable across multiple domains. It aims to address the shortcomings of existing datasets, such as small scale, dependence on manual annotation, and neglect of graph structural information.
提供机构:
都灵理工大学
创建时间:
2024-06-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作