TrainingDataPro/race-numbers-detection-and-ocr

Name: TrainingDataPro/race-numbers-detection-and-ocr
Creator: TrainingDataPro
Published: 2024-04-25 10:16:02
License: 暂无描述

Hugging Face2024-04-25 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/TrainingDataPro/race-numbers-detection-and-ocr

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: cc-by-nc-nd-4.0 task_categories: - image-to-text - object-detection tags: - code - biology dataset_info: features: - name: id dtype: int32 - name: name dtype: string - name: image dtype: image - name: mask dtype: image - name: width dtype: uint16 - name: height dtype: uint16 - name: shapes sequence: - name: label dtype: class_label: names: '0': number - name: type dtype: string - name: points sequence: sequence: float32 - name: rotation dtype: float32 - name: attributes sequence: - name: name dtype: string - name: text dtype: string splits: - name: train num_bytes: 106715580 num_examples: 30 download_size: 105575371 dataset_size: 106715580 --- # OCR Race Numbers Object Detection dataset The dataset consists of photos of runners, participating in various races. Each photo captures a runner wearing a race number on their attire. The dataset provides **bounding boxes** annotations indicating the location of the race number in each photo and includes corresponding OCR annotations, where the digit sequences on the race numbers are transcribed. # 💴 For Commercial Usage: To discuss your requirements, learn about the price and buy the dataset, leave a request on **[TrainingData](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** to buy the dataset This dataset combines the domains of sports, computer vision, and OCR technology, providing a valuable resource for advancing the field of race number detection and OCR in the context of athletic events. ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2Fa63f4fcae18a968f1f07360659f3d15a%2FFrame%2010%20(1).png?generation=1694175985579731&alt=media) # Dataset structure - **images** - contains of original images of athletes - **boxes** - includes bounding box labeling for the original images - **annotations.xml** - contains coordinates of the bounding boxes and indicated text, created for the original photo # Data Format Each image from `images` folder is accompanied by an XML-annotation in the `annotations.xml` file indicating the coordinates of the bounding boxes for text detection. For each point, the x and y coordinates are provided. # Example of XML file structure ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F61251cfa515d37f1fad650419ac22303%2Fcarbon%20(1).png?generation=1694175850461006&alt=media) # Race Numbers Detection might be made in accordance with your requirements. # 💴 Buy the Dataset: This is just an example of the data. Leave a request on **[https://trainingdata.pro/datasets](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** to discuss your requirements, learn about the price and buy the dataset ## **[TrainingData](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** provides high-quality data annotation tailored to your needs More datasets in TrainingData's Kaggle account: **https://www.kaggle.com/trainingdatapro/datasets** TrainingData's GitHub: **https://github.com/Trainingdata-datamarket/TrainingData_All_datasets** *keywords: bib number detection, bib detector, rbn, running races, marathons, racing bib number recognition, ocr annotations dataset, text detection, text recognition, optical character recognition, computer vision dataset, image dataset, image-to-text dataset, detecting text-lines, object detection, deep-text-recognition, text area detection, text extraction, images dataset, image-to-text, image classification*

提供机构：

TrainingDataPro

原始信息汇总

OCR Race Numbers Object Detection dataset

数据集概述

该数据集包含参与各种比赛的跑步者的照片，每张照片捕捉到穿着比赛号码的跑步者。数据集提供边界框标注，指示每张照片中比赛号码的位置，并包含相应的OCR标注，其中比赛号码上的数字序列被转录。

数据集结构

images - 包含运动员的原始图像
boxes - 包含原始图像的边界框标注
annotations.xml - 包含原始照片的边界框坐标和指示文本

数据格式

images文件夹中的每张图像都伴随一个annotations.xml文件，指示文本检测的边界框坐标。每个点的x和y坐标都被提供。

数据集信息

语言: 英语
许可证: cc-by-nc-nd-4.0
任务类别: 图像到文本, 目标检测
标签: code, biology

特征

id: 类型为int32
name: 类型为string
image: 类型为image
mask: 类型为image
width: 类型为uint16
height: 类型为uint16
shapes: 序列类型
- label: 类型为class_label, 名称: number
- type: 类型为string
- points: 序列类型, 类型为float32
- rotation: 类型为float32
- attributes: 序列类型
  - name: 类型为string
  - text: 类型为string

数据分割

train: 字节数为106715580, 样本数为30

数据集大小

下载大小: 105575371
数据集大小: 106715580

5,000+

优质数据集

54 个

任务类型

进入经典数据集