List of manuscripts containing John Chrysostom's Homilies and the relevant manual transcriptions
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7681132
下载链接
链接失效反馈官方服务:
资源简介:
This dataset consists of a list of all manuscripts (in the form of a .csv file) used as data in experiments with HTR training via Transkribus. The manuscripts are dated between the 10th-14th centuries and transmit John Chrysostom’s Homilies on St. Paul’s Epistles to Titus. Homilies 1 and 5 were exploited for the training process. In addition, 19 XML source files are provided in the TEI standards format, which contains a sample of the manual transcription used as ground truth data for training HTR models.
Specifically, the sample_dataset_chrysostomus_ad-titum.csv file includes the following columns:
Sigla: a capital letter used in critical editions to refer to a specific manuscript in an abbreviated form.
Manuscripts: the name of each manuscript, containing the library and the catalogue number assigned to it.
Folia: the folia (i.e., pages) of each manuscript used in the experiments. A different sequence of folia from the same manuscript is recorded in a separate row of this file.
Ground truth data sample [file_name]: the file name of the TEI/XML files that correspond to each manuscript.
Image files: most digital reproductions of manuscripts are under some degree of copyright protection. So, instead of the image files, in this column, one can find a link to the relevant library's digital archive (if applicable).
创建时间:
2023-06-30



