five

Handwritten Species Names Data

收藏
Mendeley Data2024-03-27 更新2024-06-29 收录
下载链接:
https://zenodo.org/record/2545573
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset is part of a paper presented at the 41st European Conference on Information Retrieval ,14th – 18th April 2019, in Cologne. DOI: https://doi.org/10.1007/978-3-030-15712-8_43 Data summary: Word images from 240 field notes from a natural history collection have been segmented and semantically annotated. This has been carried out in the context of the project ''Making Sense of Illustrated Handwritten Archives'', http://www.makingsenseproject.org/. From a field book on mammals, field notes from four different writers have been selected, to account for different handwriting styles and structures. The segmented word images were obtained from a nichesourcing effort, with the help of a group of domain expert labellers and a handwriting recognition system MONK, developed by Lambert Schomaker. The word images were subsequently manually annotated using four classes: Genus (0), Species(1), Author(2) and Other (3). Dataframe fields: rel_xc (relative centroid x coordinate), rel_yc (relative centroid y coordinate), page (identifier of field book page), rel_x1 (relative left x coordinate bounding box), rel_x2 (relative right x coordinate bounding box), rel_y1 (relative upper y coordinate bounding box), rel_y2 (relative lower y coordinate bounding box), type (class label, 0-3), image (pixels of word image), height (height word image), size (height * width word image), width (width word image).
创建时间:
2023-06-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作