dh-unibe/transkribus-exports-30840-raw-xml
收藏Hugging Face2026-01-22 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/dh-unibe/transkribus-exports-30840-raw-xml
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为transkribus-exports-30840-raw-xml,是通过Transkribus PageXML数据转换工具创建的。它包含137487个样本,全部位于训练集(train)中。数据集的主要特性包括图像数据(image)、XML内容(xml_content)、文件名(filename)和项目名称(project_name)。数据以parquet格式存储,并按split和project_name进行组织。数据集的使用示例展示了如何加载整个数据集或特定split。
This dataset named transkribus-exports-30840-raw-xml was created using a converter from Transkribus PageXML data. It contains 137487 samples across 1 split (train). The main features of the dataset include image data (image), XML content (xml_content), filename (filename), and project name (project_name). The data is stored in parquet format and organized by split and project_name. Usage examples are provided to demonstrate how to load the entire dataset or specific splits.
提供机构:
dh-unibe



