DBLP-Scholar
收藏arXiv2025-09-30 收录
下载链接:
https://dbs.uni-leipzig.de/file/DBLP-Scholar.zip
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为DS,其中包含了来自DBLP和谷歌学术的出版物实体。在实验中,我们通过以下属性将DBLP条目与学术条目进行匹配:标题、作者和年份,同时采用Jaccard相似度进行属性匹配,以及使用Jaro-Winkler距离对标题、作者和出版物场地进行相似度计算。该数据集规模包含10,482对条目,其中4,771对为等效条目。所执行的任务是实体解析。
This dataset is named DS, which contains publication entities sourced from DBLP and Google Scholar. In the experiments, we matched DBLP entries against Google Scholar entries using the following attributes: title, author, and year. We utilized Jaccard similarity for attribute matching, and Jaro-Winkler distance to compute the similarity of title, author, and publication venue. This dataset includes a total of 10,482 entry pairs, of which 4,771 pairs are equivalent entity pairs. The task undertaken here is entity resolution.
提供机构:
University of Leipzig



