five

ScriptNet: ICDAR 2017 Competition on Baseline Detection in Archival Documents (cBAD)

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/746925
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the training and test set for the ICDAR 2017 Competition on Baseline Detection in Archival Documents (cBAD). A newly created freely available real world dataset consisting of 2035 annotated document page images that are collected from 9 different archives and form the basis of cBAD. Two competition tracks test different characteristics of the methods submitted. Track A [Simple Documents] is published with annotated text regions and tests therefore a method's quality of text line segmentation. The more challenging Track B [Complex Documents] provides only the page area. Hence, baseline detection algorithms need to correctly locate text lines in the presence of marginalia, tables, and noise. The dataset comprises images with additional PAGE XMLs. The PAGE XMLs contain text regions and baseline annotations. Competition Website: https://scriptnet.iit.demokritos.gr/competitions/5/ Version 3 is the version of the cBad competition Version 4 contains also the page region and in case of a double-page the page split as separator.
创建时间:
2020-01-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作