Drinov Orthography for Post-OCR Correction dataset

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://zenodo.org/record/13715711

下载链接

链接失效反馈

官方服务：

资源简介：

The Drinov Orthography for Post-OCR Correction (DOPOC) dataset was created by annotating a historical newspaper collection provided by the National Library "Ivan Vazov" (NLIV) in Plovdiv, Bulgaria. We consider printed versions of these documents, which we manually annotate and align at the character level in the same format as the one from the ICDAR 2019 post-OCR correction competition.

创建时间：

2024-09-07

5,000+

优质数据集

54 个

任务类型

进入经典数据集