Nexdata/4601_Images_22_Kinds_of_Bills_OCR_Data
收藏Hugging Face2024-04-11 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Nexdata/4601_Images_22_Kinds_of_Bills_OCR_Data
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-nd-4.0
---
## Description
4,601 Images-22 Kinds of Bills OCR Data. The data background is pure color. The data covers 22 kinds of bills of multiple provinces. In terms of annotation, line-level quadrilateral bounding box annotation, line-level transcription for the texts were annotated in the data. The data can be used for tasks such as OCR for bills.
For more details, please refer to the link: https://www.nexdata.ai/dataset/1028?source=Huggingface
# Specifications
## Data size
4,601 images, 22 kinds
## Collection environment
pure color background
## Data diversity
including multiple types of bills, multiple provinces
## Device
cellphone
## Image Parameter
the image data is in .jpg format, the annotation file is in .json format
## Annotation content
line-level quadrilateral bounding box annotation, line-level transcription for the texts
## Accuracy
the error bound of each vertex of quadrilateral bounding box is within 5 pixels, which is a qualified
# Licensing Information
Commercial License
提供机构:
Nexdata
原始信息汇总
数据集概述
数据集内容
- 图像数量与种类:包含4,601张图像,涵盖22种不同类型的票据。
- 背景环境:所有图像背景为纯色。
- 数据多样性:涉及多个省份的多种票据。
技术规格
- 设备:数据采集使用手机。
- 图像格式:图像文件为.jpg格式,注释文件为.json格式。
- 注释内容:提供线级四边形边界框注释和线级文本转录。
- 精度:四边形边界框每个顶点的误差范围在5像素以内。
使用许可
- 许可类型:商业许可。



