TrainingDataPro/race-numbers-detection-and-ocr
收藏Hugging Face2024-04-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/TrainingDataPro/race-numbers-detection-and-ocr
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: cc-by-nc-nd-4.0
task_categories:
- image-to-text
- object-detection
tags:
- code
- biology
dataset_info:
features:
- name: id
dtype: int32
- name: name
dtype: string
- name: image
dtype: image
- name: mask
dtype: image
- name: width
dtype: uint16
- name: height
dtype: uint16
- name: shapes
sequence:
- name: label
dtype:
class_label:
names:
'0': number
- name: type
dtype: string
- name: points
sequence:
sequence: float32
- name: rotation
dtype: float32
- name: attributes
sequence:
- name: name
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 106715580
num_examples: 30
download_size: 105575371
dataset_size: 106715580
---
# OCR Race Numbers Object Detection dataset
The dataset consists of photos of runners, participating in various races. Each photo captures a runner wearing a race number on their attire.
The dataset provides **bounding boxes** annotations indicating the location of the race number in each photo and includes corresponding OCR annotations, where the digit sequences on the race numbers are transcribed.
# 💴 For Commercial Usage: To discuss your requirements, learn about the price and buy the dataset, leave a request on **[TrainingData](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** to buy the dataset
This dataset combines the domains of sports, computer vision, and OCR technology, providing a valuable resource for advancing the field of race number detection and OCR in the context of athletic events.
.png?generation=1694175985579731&alt=media)
# Dataset structure
- **images** - contains of original images of athletes
- **boxes** - includes bounding box labeling for the original images
- **annotations.xml** - contains coordinates of the bounding boxes and indicated text, created for the original photo
# Data Format
Each image from `images` folder is accompanied by an XML-annotation in the `annotations.xml` file indicating the coordinates of the bounding boxes for text detection. For each point, the x and y coordinates are provided.
# Example of XML file structure
.png?generation=1694175850461006&alt=media)
# Race Numbers Detection might be made in accordance with your requirements.
# 💴 Buy the Dataset: This is just an example of the data. Leave a request on **[https://trainingdata.pro/datasets](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** to discuss your requirements, learn about the price and buy the dataset
## **[TrainingData](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** provides high-quality data annotation tailored to your needs
More datasets in TrainingData's Kaggle account: **https://www.kaggle.com/trainingdatapro/datasets**
TrainingData's GitHub: **https://github.com/Trainingdata-datamarket/TrainingData_All_datasets**
*keywords: bib number detection, bib detector, rbn, running races, marathons, racing bib number recognition, ocr annotations dataset, text detection, text recognition, optical character recognition, computer vision dataset, image dataset, image-to-text dataset, detecting text-lines, object detection, deep-text-recognition, text area detection, text extraction, images dataset, image-to-text, image classification*
提供机构:
TrainingDataPro
原始信息汇总
OCR Race Numbers Object Detection dataset
数据集概述
该数据集包含参与各种比赛的跑步者的照片,每张照片捕捉到穿着比赛号码的跑步者。数据集提供边界框标注,指示每张照片中比赛号码的位置,并包含相应的OCR标注,其中比赛号码上的数字序列被转录。
数据集结构
- images - 包含运动员的原始图像
- boxes - 包含原始图像的边界框标注
- annotations.xml - 包含原始照片的边界框坐标和指示文本
数据格式
images文件夹中的每张图像都伴随一个annotations.xml文件,指示文本检测的边界框坐标。每个点的x和y坐标都被提供。
数据集信息
- 语言: 英语
- 许可证: cc-by-nc-nd-4.0
- 任务类别: 图像到文本, 目标检测
- 标签: code, biology
特征
- id: 类型为int32
- name: 类型为string
- image: 类型为image
- mask: 类型为image
- width: 类型为uint16
- height: 类型为uint16
- shapes: 序列类型
- label: 类型为class_label, 名称: number
- type: 类型为string
- points: 序列类型, 类型为float32
- rotation: 类型为float32
- attributes: 序列类型
- name: 类型为string
- text: 类型为string
数据分割
- train: 字节数为106715580, 样本数为30
数据集大小
- 下载大小: 105575371
- 数据集大小: 106715580



