five

The DLA-RMR dataset: Annotated subset of RMR notebooks for CVC development

收藏
DataCite Commons2025-11-12 更新2026-05-07 收录
下载链接:
https://www.fdr.uni-hamburg.de/record/18089
下载链接
链接失效反馈
官方服务:
资源简介:
What’s new in this version: Some annotations were missing one of the visual attributes (orientation or writing implement). All missing attributes have been added in this version.<br> <br> This dataset is structured into four components, each serving a distinct role in the development of a document analysis system.  <strong>Word-level annotations</strong> are provided in the file <code>word_annotations_for_cropped_images.json</code>. These annotations describe the images contained in the <code>cropped_images</code> folder. Each entry specifies the location of a word as a polygon, together with its orientation (horizontal, vertical, or tilted) and the type of writing implement used (ink or pencil). Additional metadata, such as bounding boxes and segmentation areas, is also included. <strong>Cropped images</strong> are stored in the <code>cropped_images</code> folder. This set comprises 50 images, each containing only the primary page extracted from the corresponding full notebook scans. <strong>Full images</strong> are located in the <code>full_images</code> folder. This collection also contains 50 items, representing the complete notebook scans in which the primary page appears alongside other material. <strong>Page-level annotations</strong> are contained in the <code>page_annotations</code> folder. These are provided in YOLO format, with a single class (<code>page</code>) defined in <code>classes.txt</code>. Each annotation file specifies the bounding box of the primary page within the corresponding image in the <code>full_images</code> folder. Examples illustrate the annotation structure. In the JSON file, a typical word annotation records polygon coordinates, the attribute <code>"orientation": "horizontal"</code>, and <code>"writing_tool": "pencil"</code>. In the YOLO annotations, a sample entry such as <code>0 0.499023 0.500776 0.777344 0.816912</code> denotes the normalised coordinates of the primary page bounding box. <strong>Acknowledgement:</strong> The research for this work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - EXC 2176 ‘Understanding Written Artefacts: Material, Interaction and Transmission in Manuscript Cultures’, project no. 390893796. The research was conducted within the scope of the Centre for the Study of Manuscript Cultures (CSMC) at Universität Hamburg.<br> <br> The images are taken from notebook pages of Rainer Maria Rilke, from the Deutsche Literaturarchiv Marbach (DLA), A:Rilke-Archiv Gernsbach.  We thank Hui Xu for her support in annotating the images.
提供机构:
Universität Hamburg
创建时间:
2025-11-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作