five

Recognizing Figure Labels in Patents [AAAI 2021 SDU Workshop]

收藏
DataCite Commons2025-06-01 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Recognizing_Figure_Labels_in_Patents_AAAI_2021_SDU_Workshop_/13416311/1
下载链接
链接失效反馈
官方服务:
资源简介:
* 100patents_design_original_png.zip: 100 figures extracted from 100 US DESIGN patents, original (unrotated), PNG format * 100patents_design_original-tif.zip: 100 figures extracted from 100 US DESIGN patents, original (unrotated), TIF format* 100patents_design_rotated-tif.zip: 100 figures extracted from 100 US DESIGN patents, rotated to upright if needed TIF format* 100patents_design_rotated-png.zip: 100 figures extracted from 100 US DESIGN patents, rotated to upright if needed, PNG format* all_results_labels.xml: ground truth labels and labels extracted by 8 tools: SWT, Adobe Acrobat, EAST, Amazon Textract, Tesseract, Google Vision API, Abbyy, and the alpha-shape method.<br>For details, see the following paper: Gong Ming, Xin Wei, Diane Oyen, Jian Wu, Martin Gryder, and Liping Yang. "Recognizing Figure Labels in Patents." In: AAAI-2021 workshop on Scientific Document Understanding (SDU). Virtual Event.

* 100patents_design_original_png.zip:从100项美国外观设计专利中提取的100幅原图(未旋转),PNG格式压缩包。 * 100patents_design_original-tif.zip:从100项美国外观设计专利中提取的100幅原图(未旋转),TIF格式压缩包。 * 100patents_design_rotated-tif.zip:从100项美国外观设计专利中提取的100幅图像,可根据需要旋转至正向姿态,TIF格式压缩包。 * 100patents_design_rotated-png.zip:从100项美国外观设计专利中提取的100幅图像,可根据需要旋转至正向姿态,PNG格式压缩包。 * all_results_labels.xml:包含基准真值标签与8种工具提取的标签的XML文件,这8种工具分别为:笔画宽度变换(Stroke Width Transform, SWT)、Adobe Acrobat、EAST、Amazon Textract、Tesseract、Google Vision API、Abbyy以及alpha形状(alpha-shape)方法。<br>详细内容请参阅以下论文:龚明、辛伟、Diane Oyen、吴健、Martin Gryder、杨立平。《专利中的图像标签识别》,发表于:AAAI-2021科学文献理解(SDU)研讨会,虚拟会议。
提供机构:
figshare
创建时间:
2020-12-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作