IndoAksaraOCR/OCRData-Validated
收藏Hugging Face2025-02-14 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/IndoAksaraOCR/OCRData-Validated
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多个配置,分别为:图像分割、图像转录(OCR)、图像翻译、图像转写、转录语言识别、转录翻译、转录转写、转写语言识别和转写翻译。每个配置都有相应的特征字段,如图片ID、图片URL、高度、宽度、语言、分割信息、转录、翻译、转写、语言标签等。数据集划分为训练集,并提供了各个配置的训练集文件路径。
The dataset consists of multiple configurations, including Image Segmentation, Image Transcription (OCR), Image Translation, Image Transliteration, Transcription LID, Transcription Translation, Transcription Transliteration, Transliteration LID, and Transliteration Translation. Each configuration has corresponding feature fields such as image ID, image URL, height, width, language, segmentation information, transcription, translation, transliteration, language label, etc. The dataset is split into training sets, and the file paths for each configurations training set are provided.
提供机构:
IndoAksaraOCR



