OCR识别套餐
收藏华东江苏大数据交易中心2022-11-18 更新2024-03-01 收录
下载链接:
http://www.hddatapay.com/dataProductInfo/Details/121
下载链接
链接失效反馈官方服务:
资源简介:
广泛适用于各种影印文档,图片和照片,可有效排除图像噪点、图章、污损等干扰因素,综合识别率达到99%以上。
识别读取身份证、驾驶证、户口本、营业执照、不动产证书等常用证照的文字内容。
读取表格中的文字,将非结构化表格数据转化为键值,excel,json等结构化数据,适应超长跨页、有框,虚线、等不同种类表格类文档。
This dataset is widely applicable to various photocopied documents, images and photos. It can effectively eliminate interfering factors such as image noise, stamps and stains, with an overall recognition rate exceeding 99%. It can recognize and extract text content from common official certificates including ID cards, driver's licenses, household registers, business licenses and real estate certificates. It can extract text from tables, convert unstructured tabular data into structured data formats such as key-value pairs, Excel and JSON, and supports various types of tabular documents including ultra-long cross-page documents, framed tables and dashed-border tables.
提供机构:
苏州美能华智能科技有限公司
创建时间:
2022-11-18
搜集汇总
背景与挑战
背景概述
该OCR识别套餐支持身份证、营业执照等多种证照及表格文档的文本提取,能有效处理图像噪点并实现99%以上的识别准确率,输出键值对、Excel等结构化数据。特别适用于跨页表格等复杂文档的非结构化数据转换。
以上内容由遇见数据集搜集并总结生成



