TextOCR 文本识别数据集

超神经2022-10-26 更新2024-05-15 收录

下载链接：

https://hyper.ai/cn/datasets/20131

下载链接

链接失效反馈

官方服务：

资源简介：

OCR 全称 optical character recognition，TextOCR 是用于对任意场景文本进行检测和识别的数据集。 TextOCR 为 TextVQA 中的图像提供了约 100 万个高质量的词汇标注，并且能在视觉问答或图像说明等下游任务上实行端到端的推理。

OCR stands for Optical Character Recognition. TextOCR is a dataset dedicated to detection and recognition of arbitrary scene text. It provides approximately 1 million high-quality word-level annotations for the images in TextVQA, and enables end-to-end inference for downstream tasks such as visual question answering (VQA) and image captioning.

创建时间：

2022-10-26

搜集汇总

数据集介绍

背景与挑战

背景概述

TextOCR是一个用于场景文本检测与识别的数据集，包含来自TextVQA的28,134张图像和约90.3万个高质量文本标注，平均每图32个词。该数据集支持视觉问答等下游任务的端到端推理，遵循CC BY 4.0许可。

以上内容由遇见数据集搜集并总结生成