LIACC/PORTO
收藏Hugging Face2025-06-13 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/LIACC/PORTO
下载链接
链接失效反馈官方服务:
资源简介:
PORTO数据集是一个针对历史葡萄牙文献的OCR和Post-OCR评估和开发资源。数据集包含多个字段,如文本语料库、书籍名称、日期、文件名、转录文本等,适用于图像到文本、填空和文本生成等任务。
PORTO dataset is a resource for evaluation and development of OCR and Post-OCR focused on historical Portuguese documents. The dataset includes fields such as text corpus, book name, date, filename, transcription text, etc., and is suitable for tasks like image-to-text, fill-mask, and text generation.
提供机构:
LIACC



