Natural Scene Braille Character Recognition Dataset
收藏DataCite Commons2025-04-27 更新2025-04-16 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=b1df1a601acc47a6984aafa8f3ab8e92
下载链接
链接失效反馈官方服务:
资源简介:
There are a total of 1157 Braille segment images in this dataset, including 925 in the training set and 232 in the testing set. There are two folders in the directory of this dataset: character_label and segment_label. The character_rabel file contains three formats of Braille segment images: (1) Braille segment images and label files stored in ICDAR-2015 format, each. jpg file corresponds to a. txt file, where each line stores the position and recognition label of a braille character rectangle box. The data corresponds to the coordinates of the four points in the rectangle box and the recognized numerical label; (2) The original format of the data is stored in the folder org. Each .jpg file in this folder corresponds to a .json file which marked by labelme software; (3) VOC format, stored in voc-data folder. This folder stores images and corresponding .xml files in VOC format, and marks the position of each braille character rectangle box and its corresponding numerical label information in the .xml file. In addition, the original Braille images of natural scenes and the corresponding Braille segment markings .json files are stored in the folder segment_label.
本数据集共计1157张盲文分段图像,其中训练集包含925张,测试集包含232张。该数据集目录下设两个子文件夹:character_label与segment_label。其中character_label文件夹内包含三类盲文分段相关数据:(1) ICDAR-2015格式:该格式下图像与标签一一对应,每张.jpg格式图像对应一个.txt格式标签文件,文件每行存储一个盲文字符矩形框的位置与识别标签,具体包含矩形框的四个顶点坐标以及识别得到的数值标签;(2) 原始标注格式:该格式数据存储于org子文件夹中,文件夹内每张.jpg图像对应一份由Labelme标注软件生成的.json格式标签文件;(3) VOC格式:该格式数据存储于voc-data文件夹中,文件夹内包含VOC格式的图像与对应的.xml格式标签文件,.xml文件中会标注每个盲文字符矩形框的位置及其对应的数值标签信息。此外,自然场景下的原始盲文图像及其对应的盲文分段标注.json文件均存储于segment_label文件夹中。
提供机构:
Science Data Bank
创建时间:
2024-07-18
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个用于自然场景盲文字符识别的图像数据集,包含1157张盲文分割图像,分为训练集和测试集。数据集提供三种标注格式(ICDAR-2015、原始JSON和VOC),支持多种计算机视觉任务,如目标检测和字符识别,并包含自然场景图像以增强实用性。
以上内容由遇见数据集搜集并总结生成



