Drinov Orthography for Post-OCR Correction dataset
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13715711
下载链接
链接失效反馈官方服务:
资源简介:
The Drinov Orthography for Post-OCR Correction (DOPOC) dataset was created by annotating a historical newspaper collection provided by the National Library "Ivan Vazov" (NLIV) in Plovdiv, Bulgaria. We consider printed versions of these documents, which we manually annotate and align at the character level in the same format as the one from the ICDAR 2019 post-OCR correction competition.
创建时间:
2024-09-07



