Paris-area Assignee Benchmark
收藏DataCite Commons2020-09-04 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/Paris-area_Assignee_Benchmark/3502766
下载链接
链接失效反馈官方服务:
资源简介:
The manual disambiguation of EPO/PCT assignees in the Paris area, used as one of the benchmarks in the paper. In all columns, if more than one ID is found, the elements are comma separated. 5 columns, 18877 rows, "|" delimited; the columns are:--Pub_number: the publication number of the patent--ManualIDs: The manually disambiguated IDs of each inventor on the patent. Each ID has the following structure: (1) a leading character of "r", indicating within-region, or "e", indicating an external collaborator; (2) a number with specific meaning; (3) an "\_"; and (4) a manual classification of the type of institution. The institution can have the values "university", "hospital", "company", "lab", and "other".--OurIDs: the output of our algorithm on this patent. --HanIDs: The HAN IDs for this patent. --RawNames: the undisambiguated names on the patent, with case, punctuation, and spacing dropped. Note that there is no correspondence between the boston-area IDs and the paris-area IDs. Do not attempt to mix the two disambiguations together as incorrect ID-clashes are possible.
提供机构:
figshare
创建时间:
2017-04-04



