Post-OCR correction training dataset sPeriodika-postOCR
收藏B2FIND2026-04-29 收录
下载链接:
https://b2find.eudat.eu/dataset/b9062910-c8b6-55ca-9c2e-f0957867a1bb
下载链接
链接失效反馈官方服务:
资源简介:
The post-OCR correction dataset consists of paragraphs of text, at least 100 characters in length, extracted from documents randomly sampled from the sPeriodika dataset...



