five

List of manuscripts containing John Chrysostom's Homilies and the relevant manual transcriptions

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7681132
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of a list of all manuscripts (in the form of a .csv file) used as data in experiments with HTR training via Transkribus. The manuscripts are dated between the 10th-14th centuries and transmit John Chrysostom’s Homilies on St. Paul’s Epistles to Titus. Homilies 1 and 5 were exploited for the training process. In addition, 19 XML source files are provided in the TEI standards format, which contains a sample of the manual transcription used as ground truth data for training HTR models. Specifically, the sample_dataset_chrysostomus_ad-titum.csv file includes the following columns: Sigla: a capital letter used in critical editions to refer to a specific manuscript in an abbreviated form. Manuscripts: the name of each manuscript, containing the library and the catalogue number assigned to it. Folia: the folia (i.e., pages) of each manuscript used in the experiments. A different sequence of folia from the same manuscript is recorded in a separate row of this file. Ground truth data sample [file_name]: the file name of the TEI/XML files that correspond to each manuscript. Image files: most digital reproductions of manuscripts are under some degree of copyright protection. So, instead of the image files, in this column, one can find a link to the relevant library's digital archive (if applicable).
创建时间:
2023-06-30
二维码
社区交流群
二维码
科研交流群
商业服务