[SAMPLE] Nexdata | OCR Data | 500,000 Images| Computer Vision Data| AI & ML Training Data
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/a88493d5-1590-45d0-adfa-b450865b54c9/Nexdata_SAMPLE-Nexdata-OCR-Data-500,000 Images-Computer-Vision-Data-AI-&-ML-Training-Data
下载链接
链接失效反馈官方服务:
资源简介:
1. Specifications
Data size : 500,000 images
Collecting environment : including shop plaque, stop board, poster, ticket, road sign, comic, cover picture, prompt/reminder, warning, packing instruction, menu, building sign, etc.
Diversity : including 20 languages, multiple natural scenes, multiple photographic angles (looking up angle, looking down angle, eye-level angle)
Device : cellphone, camera
Image parameter : the image data format is .jpg, and the annotation file data format is .json
Annotation content : line-level quadrilateral bounding box annotation and transcription for the texts
Accuracy : the error bound of each vertex of quadrilateral bounding box is within 5 pixels, which is a qualified annotation, the accuracy of bounding boxes is not less than 97%; the texts transcription accuracy is not less than 97%
2. About Nexdata
Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go AI & ML Training Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/ocrTraining?source=Datarade
提供机构:
Nexdata



