five

Boston-area Assignee Benchmark

收藏
Figshare2017-04-04 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Boston-area_Assignee_Benchmark/3502757/1
下载链接
链接失效反馈
官方服务:
资源简介:
The manual disambiguation of EPO/PCT assignees in the Boston area, used as one of the benchmarks in the paper. In all columns, if more than one ID is found, the elements are comma separated. 5 columns, 22528 rows, "|" delimited; the columns are--Pub\_number: the publication number of the patent--ManualIDs: The manually disambiguated IDs of each inventor on the patent. Each ID has the following structure: (1) a leading character of "r", indicating within-region, or "e", indicating an external collaborator; (2) a number with no specific meaning; (3) an "\_"; and (4) a manual classification of the type of institution. The institution can have the values "university", "hospital", "company", "lab", and "other".--OurIDs: the output of our algorithm on this patent. --HanIDs: The HAN IDs for this patent. --RawNames: the undisambiguated names on the patent, with case, punctuation, and spacing dropped. Note that there is no correspondence between the boston-area IDs and the paris-area IDs. Do not attempt to mix the two disambiguations together as incorrect ID-clashes are possible.
创建时间:
2017-04-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作