five

HTR Winter School 2023/2024 - Late Medieval Latin, ONB 3891

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10589478
下载链接
链接失效反馈
官方服务:
资源简介:
HTR Winter School 2023/2024 - Late Medieval Latin, ONB 3891 Short description of the record Ground truth for ÖNB 3891 manuscript prepared in Transkribus within Winter School of Handwritten Text Recognition of Medieval Manuscripts, 2023/2024 in Vienna.   Description Sermones by Thomas Ebendorfer (1388-1464) as found in MS Vienna, Austrian National Library (ÖNB), Cod. 3891. Wolfgang Chranekker, an organist in St. Wolfgang, finished the writing in 1441. See the description of the manuscript at [Manuscripta.at](https://manuscripta.at/hs_detail.php?ID=6137).  Writing: Latin, Bastarda, mid 15th C.   Origin of the data Source of images: Austrian National Library   Description or citation of transcription guidelines  expanded abbrevations  preserved original punctuation  preserved the original interpunction  used "/" for virgula  didn´t add "." at the end of sentences  used ¬ at the end of the line if a word is divided  used "v" for consonant and "u" for vocal  used i for i/j  used s for ſ/s  used c/t as in the manuscript  no capitalization of letters  preserved "ll" in the place of L, "ff" in the place of "F", etc.  separated prepositions from words  wrote words together that ought to be written together   preserved numbers See Google docs On encoding Pages are marked as "ground truth", if they were checked more than once, and "final", if they were checked only once. Unclear passages are marked as "unclear" in Transkribus. How to cite This dataset was created by Cehuľová Viktória, Ciuntu Mara-Elena, Engelmaier Leonhard, Kohn Albert, Lukáč Kováčová Magdaléna, Lukáč Labancová Ivana, Mihaljević Ana, Odstrčilík Jan, Roček Martin, Rokpelne Liene, Scalia Andrea, Šaldová Zuzana, Vašíček Andrej, Yücel Fatih, Zelenková Adéla. The digitisation is not copyright free, but the transcription is. However, properly annotating a corpus takes time and is a task that should be recognised. If you use any item from this corpus as ground truth, cite the dataset using the following information   Copyright and licence This dataset was created as part of the Winter School of Handwritten Text Recognition of Medieval Manuscripts 2023/2024, Vienna at the Österreichische Akademie der Wissenschaften, Institut für Mittelalterforschung, all transcriptions are licensed under the Creative Commons 4 licence. Images were provided by the Austrian National Library (ÖNB).
创建时间:
2024-01-30
二维码
社区交流群
二维码
科研交流群
商业服务