five

TrainingDataPro/race-numbers-detection-and-ocr

收藏
Hugging Face2024-04-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/TrainingDataPro/race-numbers-detection-and-ocr
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: cc-by-nc-nd-4.0 task_categories: - image-to-text - object-detection tags: - code - biology dataset_info: features: - name: id dtype: int32 - name: name dtype: string - name: image dtype: image - name: mask dtype: image - name: width dtype: uint16 - name: height dtype: uint16 - name: shapes sequence: - name: label dtype: class_label: names: '0': number - name: type dtype: string - name: points sequence: sequence: float32 - name: rotation dtype: float32 - name: attributes sequence: - name: name dtype: string - name: text dtype: string splits: - name: train num_bytes: 106715580 num_examples: 30 download_size: 105575371 dataset_size: 106715580 --- # OCR Race Numbers Object Detection dataset The dataset consists of photos of runners, participating in various races. Each photo captures a runner wearing a race number on their attire. The dataset provides **bounding boxes** annotations indicating the location of the race number in each photo and includes corresponding OCR annotations, where the digit sequences on the race numbers are transcribed. # 💴 For Commercial Usage: To discuss your requirements, learn about the price and buy the dataset, leave a request on **[TrainingData](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** to buy the dataset This dataset combines the domains of sports, computer vision, and OCR technology, providing a valuable resource for advancing the field of race number detection and OCR in the context of athletic events. ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2Fa63f4fcae18a968f1f07360659f3d15a%2FFrame%2010%20(1).png?generation=1694175985579731&alt=media) # Dataset structure - **images** - contains of original images of athletes - **boxes** - includes bounding box labeling for the original images - **annotations.xml** - contains coordinates of the bounding boxes and indicated text, created for the original photo # Data Format Each image from `images` folder is accompanied by an XML-annotation in the `annotations.xml` file indicating the coordinates of the bounding boxes for text detection. For each point, the x and y coordinates are provided. # Example of XML file structure ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F61251cfa515d37f1fad650419ac22303%2Fcarbon%20(1).png?generation=1694175850461006&alt=media) # Race Numbers Detection might be made in accordance with your requirements. # 💴 Buy the Dataset: This is just an example of the data. Leave a request on **[https://trainingdata.pro/datasets](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** to discuss your requirements, learn about the price and buy the dataset ## **[TrainingData](https://trainingdata.pro/datasets/racing-bib-number-recognition?utm_source=huggingface&utm_medium=cpc&utm_campaign=race-numbers-detection-and-ocr)** provides high-quality data annotation tailored to your needs More datasets in TrainingData's Kaggle account: **https://www.kaggle.com/trainingdatapro/datasets** TrainingData's GitHub: **https://github.com/Trainingdata-datamarket/TrainingData_All_datasets** *keywords: bib number detection, bib detector, rbn, running races, marathons, racing bib number recognition, ocr annotations dataset, text detection, text recognition, optical character recognition, computer vision dataset, image dataset, image-to-text dataset, detecting text-lines, object detection, deep-text-recognition, text area detection, text extraction, images dataset, image-to-text, image classification*
提供机构:
TrainingDataPro
原始信息汇总

OCR Race Numbers Object Detection dataset

数据集概述

该数据集包含参与各种比赛的跑步者的照片,每张照片捕捉到穿着比赛号码的跑步者。数据集提供边界框标注,指示每张照片中比赛号码的位置,并包含相应的OCR标注,其中比赛号码上的数字序列被转录。

数据集结构

  • images - 包含运动员的原始图像
  • boxes - 包含原始图像的边界框标注
  • annotations.xml - 包含原始照片的边界框坐标和指示文本

数据格式

images文件夹中的每张图像都伴随一个annotations.xml文件,指示文本检测的边界框坐标。每个点的x和y坐标都被提供。

数据集信息

  • 语言: 英语
  • 许可证: cc-by-nc-nd-4.0
  • 任务类别: 图像到文本, 目标检测
  • 标签: code, biology

特征

  • id: 类型为int32
  • name: 类型为string
  • image: 类型为image
  • mask: 类型为image
  • width: 类型为uint16
  • height: 类型为uint16
  • shapes: 序列类型
    • label: 类型为class_label, 名称: number
    • type: 类型为string
    • points: 序列类型, 类型为float32
    • rotation: 类型为float32
    • attributes: 序列类型
      • name: 类型为string
      • text: 类型为string

数据分割

  • train: 字节数为106715580, 样本数为30

数据集大小

  • 下载大小: 105575371
  • 数据集大小: 106715580
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作