Boston-area Assignee Benchmark
收藏Figshare2017-04-04 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Boston-area_Assignee_Benchmark/3502757/1
下载链接
链接失效反馈官方服务:
资源简介:
The manual disambiguation of EPO/PCT assignees in the Boston area, used as one of the benchmarks in the paper. In all columns, if more than one ID is found, the elements are comma separated. 5 columns, 22528 rows, "|" delimited; the columns are--Pub\_number: the publication number of the patent--ManualIDs: The manually disambiguated IDs of each inventor on the patent. Each ID has the following structure: (1) a leading character of "r", indicating within-region, or "e", indicating an external collaborator; (2) a number with no specific meaning; (3) an "\_"; and (4) a manual classification of the type of institution. The institution can have the values "university", "hospital", "company", "lab", and "other".--OurIDs: the output of our algorithm on this patent. --HanIDs: The HAN IDs for this patent. --RawNames: the undisambiguated names on the patent, with case, punctuation, and spacing dropped. Note that there is no correspondence between the boston-area IDs and the paris-area IDs. Do not attempt to mix the two disambiguations together as incorrect ID-clashes are possible.
创建时间:
2017-04-04



