five

Dataset for paper: " Knowledge Graph Embeddings based Approach for Author Name Disambiguation using Literals"

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5675786
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists in two distinct scholarly knowledge graph created from two publicly available bibliographic datasets: 1) a triplestore covering information about the journal Scientometrics provided by OpenCitations (available here), and 2) the AMiner AND benchmark from 2018 available here. This KG was extracted for a research project on knowledge graph embeddings (KGEs) for author disambiguation. Structural triples of the knowledge graphs are split into training, testing and validation for applying representation learning methods. Textual literals and numeric literals were stored separately in order to implement multimodal approaches for KGEs (see arXiv:1802.00934). For the same reason, textual literals and numeric literals are already stored into sentence embeddings and a numeric matrix respectively in the files textual_literals.npy and numeric_literals.npy in order to simplify the representation learning task. The file and_eval.json of each KG contains the evaluation dataset used for evaluating our AND architecture. For the script used to gather this dataset see https://github.com/sntcristian/and-kge/tree/main/src/AMiner-534K and https://github.com/sntcristian/and-kge/tree/main/src/OC-782K.
创建时间:
2022-04-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作