CREMMA Medieval - abbr and expan altos
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7506656
下载链接
链接失效反馈官方服务:
资源简介:
This data set is derived from the CREMMA-Medieval dataset (https://github.com/HTR-United/cremma-medieval). It was modified to include both abbreviated and expanded forms in separated ALTO, for HTR training experiments.
The original data set was created with the support of the DIM MAP in the context of the CREMMA project (https://www.dim-map.fr/projets-soutenus/cremma/).
This version was prepared in March-June 2022 as part of the research for the following paper:
Camps, Jean-Baptiste, Chahan Vidal-Gorène, Dominique Stutzmann, Marguerite Vernet, and Ariane Pinche. « Data Diversity in Handwritten Text Recognition: Challenge or Opportunity? » In Digital Humanities 2022. Conference Abstracts (The University of Tokyo, Japan, 25-29 July 2022), published by DH2022 Local Organizing Committee, 160‑65. Tokyo, 2022. https://dh2022.dhii.asia/dh2022bookofabsts.pdf#page=162 and https://dh2022.dhii.asia/abstracts/files/CAMPS_Jean_Baptiste_Data_Diversity_in_handwritten_text_recog.html.
If you use this dataset, please quote:
@incollection{dh2022_local_organizing_committee_data_2022,
address = {Tokyo},
title = {Data {Diversity} in handwritten text recognition: challenge or opportunity?},
url = {https://dh2022.dhii.asia/dh2022bookofabsts.pdf},
language = {en},
urldate = {2022-08-02},
booktitle = {Digital {Humanities} 2022. {Conference} {Abstracts} ({The} {University} of {Tokyo}, {Japan}, 25-29 {July} 2022)},
author = {Camps, Jean-Baptiste and Vidal-Gorène, Chahan and Stutzmann, Dominique and Vernet, Marguerite and Pinche, Ariane},
editor = {{DH2022 Local Organizing Committee}},
year = {2022},
pages = {160--165},
}
Folders
img
Folder with the scans of the base documents.
alto
ALTO files, in abbreviated and expanded text form, with or without normalisations. For more details, please see the aforementioned paper.
创建时间:
2023-01-06



