five

CREMMA Medieval - abbr and expan altos

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7506656
下载链接
链接失效反馈
官方服务:
资源简介:
This data set is derived from the CREMMA-Medieval dataset (https://github.com/HTR-United/cremma-medieval). It was modified to include both abbreviated and expanded forms in separated ALTO, for HTR training experiments. The original data set was created with the support of the DIM MAP in the context of the CREMMA project (https://www.dim-map.fr/projets-soutenus/cremma/). This version was prepared in March-June 2022 as part of the research for the following paper: Camps, Jean-Baptiste, Chahan Vidal-Gorène, Dominique Stutzmann, Marguerite Vernet, and Ariane Pinche. « Data Diversity in Handwritten Text Recognition: Challenge or Opportunity? » In Digital Humanities 2022. Conference Abstracts (The University of Tokyo, Japan, 25-29 July 2022), published by DH2022 Local Organizing Committee, 160‑65. Tokyo, 2022. https://dh2022.dhii.asia/dh2022bookofabsts.pdf#page=162  and  https://dh2022.dhii.asia/abstracts/files/CAMPS_Jean_Baptiste_Data_Diversity_in_handwritten_text_recog.html. If you use this dataset, please quote: @incollection{dh2022_local_organizing_committee_data_2022, address = {Tokyo}, title = {Data {Diversity} in handwritten text recognition: challenge or opportunity?}, url = {https://dh2022.dhii.asia/dh2022bookofabsts.pdf}, language = {en}, urldate = {2022-08-02}, booktitle = {Digital {Humanities} 2022. {Conference} {Abstracts} ({The} {University} of {Tokyo}, {Japan}, 25-29 {July} 2022)}, author = {Camps, Jean-Baptiste and Vidal-Gorène, Chahan and Stutzmann, Dominique and Vernet, Marguerite and Pinche, Ariane}, editor = {{DH2022 Local Organizing Committee}}, year = {2022}, pages = {160--165}, } Folders img Folder with the scans of the base documents. alto ALTO files, in abbreviated and expanded text form, with or without normalisations. For more details, please see the aforementioned paper.
创建时间:
2023-01-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作