edyhvh/hutter
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/edyhvh/hutter
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含希伯来语圣经文本的渲染图像,特别是Elias Hutter的Besorah(新约)中的书籍。数据集分为两个主要子集:希伯来语(处理/渲染的希伯来语文本图像,共1,243张图像)和原始(原始文本图像,共2,541张图像)。每个书籍都有自己的目录,图像按顺序编号(例如,000001.png,000002.png)。数据集支持的任务包括光学字符识别(OCR)、图像到文本转换、圣经文本处理、文档理解和多语言OCR研究。
This dataset contains rendered images of biblical texts in Hebrew, specifically covering books from the Elias Hutters Besorah (New Testament). The dataset is organized into two main subsets: Hebrew (processed/rendered Hebrew text images, 1,243 images) and Raw (original raw text images, 2,541 images). Each book is organized in its own directory, with images sequentially numbered (e.g., 000001.png, 000002.png). The dataset is designed for Optical Character Recognition (OCR), Image-to-Text conversion, Biblical Text Processing, Document Understanding, and Multilingual OCR research.
提供机构:
edyhvh



