Epicguest97/doclaynet10classes
收藏Hugging Face2025-03-29 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Epicguest97/doclaynet10classes
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含图像及其相关信息的复杂数据集,其中包括图像的边界框、类别ID、分割信息、区域面积以及PDF文档的相关单元格信息,如字体、文本和单元格的边界框。数据集还包含一些元数据,如图像的尺寸、文档类别、图像ID、页数、原始文件名、原始尺寸、页面哈希和页码。数据集分为训练集、测试集和验证集,用于不同的机器学习任务。
This is a complex dataset containing images and their related information, including bounding boxes, category IDs, segmentation information, area sizes, and related cell information of PDF documents such as font, text, and cell bounding boxes. The dataset also contains metadata such as image dimensions, document category, image ID, number of pages, original filename, original dimensions, page hash, and page number. The dataset is split into training, test, and validation sets for different machine learning tasks.
提供机构:
Epicguest97



