Ryoo72/DocStruct4M
收藏Hugging Face2025-02-07 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Ryoo72/DocStruct4M
下载链接
链接失效反馈官方服务:
资源简介:
一个包含文本和图片的数据集,由multi_grained_text_localization和struct_aware_parse两个数据集合并而成,经过筛选和处理后上传到HuggingFace Hub。数据集分为训练集和验证集,适用于文档结构分析和文本定位等任务。
A dataset containing text and images, merged from the multi_grained_text_localization and struct_aware_parse datasets, filtered and processed, then uploaded to HuggingFace Hub. The dataset is divided into training and validation sets, suitable for tasks such as document structure analysis and text localization.
提供机构:
Ryoo72



