rupindersingh1313/demo_synth_a4_size_document_data
收藏Hugging Face2025-10-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/rupindersingh1313/demo_synth_a4_size_document_data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含图片和对应的注释信息,注释中详细记录了文档的图片尺寸、语言、文本行和单词的信息,包括文本行的多边形坐标、阅读顺序和文本内容,以及单词的音节单位、多边形坐标、阅读顺序、文本和单词ID。数据集分为训练集、验证集和测试集,适用于文本识别等自然语言处理任务。
The dataset includes images and corresponding annotation information, which records in detail the documents image dimensions, language, text lines, and word information, including the text lines polygon coordinates, reading order, and text content, as well as the words phonetic units, polygon coordinates, reading order, text, and word ID. The dataset is divided into training, validation, and test sets, suitable for tasks such as text recognition in natural language processing.
提供机构:
rupindersingh1313



