Extended_CLEF_IP_2011_Dataset
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10019327
下载链接
链接失效反馈官方服务:
资源简介:
Extended CLEF-IP 2011 Dataset: We use the 2011 benchmark dataset of CLEF-IP [23] for visualization type classification. However, it does not cover block and
circuit diagrams, which are an important type of visualization frequently used in patents. We added this category as a tenth class and collected images by querying EPO's publication server (https://data.epo.org/publication-server/). Finally, we manually annotated images depicting block and circuit diagrams. The dataset statistics are provided in Table 1 (left).
Paper title: Classification of Visualization Types and Perspectives in Patents
Paper link: https://link.springer.com/chapter/10.1007/978-3-031-43849-3_16
Git repository link: https://github.com/TIBHannover/PatentImageClassification
Cite as:
Ghauri, J.A., Müller-Budack, E., Ewerth, R. (2023). Classification of Visualization Types and Perspectives in Patents. In: Alonso, O., Cousijn, H., Silvello, G., Marrero, M., Teixeira Lopes, C., Marchesin, S. (eds) Linking Theory and Practice of Digital Libraries. TPDL 2023. Lecture Notes in Computer Science, vol 14241. Springer, Cham. https://doi.org/10.1007/978-3-031-43849-3_16
OR
@InProceedings{10.1007/978-3-031-43849-3_16,
author="Ghauri, Junaid Ahmed
and M{\"u}ller-Budack, Eric
and Ewerth, Ralph",
editor="Alonso, Omar
and Cousijn, Helena
and Silvello, Gianmaria
and Marrero, M{\'o}nica
and Teixeira Lopes, Carla
and Marchesin, Stefano",
title="Classification of Visualization Types and Perspectives in Patents",
booktitle="Linking Theory and Practice of Digital Libraries",
year="2023",
publisher="Springer Nature Switzerland",
address="Cham",
pages="182--191",
abstract="Due to the swift growth of patent applications each year, information and multimedia retrieval approaches that facilitate patent exploration and retrieval are of utmost importance. Different types of visualizations (e.g., graphs, technical drawings) and perspectives (e.g., side view, perspective) are used to visualize details of innovations in patents. The classification of these images enables a more efficient search in digital libraries and allows for further analysis. So far, datasets for image type classification miss some important visualization types for patents. Furthermore, related work does not make use of recent deep learning approaches including transformers. In this paper, we adopt state-of-the-art deep learning methods for the classification of visualization types and perspectives in patent images. We extend the CLEF-IP dataset for image type classification in patents to ten classes and provide manual ground truth annotations. In addition, we derive a set of hierarchical classes from a dataset that provides weakly-labeled data for image perspectives. Experimental results have demonstrated the feasibility of the proposed approaches. Source code, models, and datasets are publicly available (https://github.com/TIBHannover/PatentImageClassification).",
isbn="978-3-031-43849-3"
}
创建时间:
2023-10-19



