D4LA-版面分析数据集

Name: D4LA-版面分析数据集
Creator: maas
Published: 2026-05-23 17:56:37
License: 暂无描述

魔搭社区2026-05-23 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/iic/D4LA

下载链接

链接失效反馈

官方服务：

资源简介：

# D4LA 文档版面分析数据集 (the most Diverse and Detailed Dataset ever for Document Layout Analysis) ## 数据集概述 D4LA是面向文档版面分析的数据集。 ### 数据集简介包含12类文档工27类文档版面类型，详细如下: ![d4la](./d4la.jpg) ### 标注格式 ``` D4LA ├── train_images │ ├── 1.jpg ├── test_images │ ├── 2.jpg ├── VGT_D4LA_grid_pkl │ ├── 1.pkl │ └── 2.pkl ├── json │ ├── train.json │ └── test.json ``` ### 引用方式 If you find this repository useful, please consider citing our work: ``` @inproceedings{da2023vgt, title={Vision Grid Transformer for Document Layout Analysis}, author={Cheng Da and Chuwei Luo and Qi Zheng and Cong Yao}, year={2023}, booktitle = {ICCV}, } ``` ### Clone with HTTP ```bash git clone https://www.modelscope.cn/datasets/damo/D4LA.git ```

# D4LA 文档版面分析（Document Layout Analysis）数据集（目前最具多样性与完备性的文档版面分析数据集） ## 数据集概述 D4LA是专为文档版面分析任务打造的数据集。 ### 数据集简介本数据集涵盖12类文档类别与27类文档版面类型，详情如下： ![d4la](./d4la.jpg) ### 标注格式 D4LA ├── train_images │ ├── 1.jpg ├── test_images │ ├── 2.jpg ├── VGT_D4LA_grid_pkl │ ├── 1.pkl │ └── 2.pkl ├── json │ ├── train.json │ └── test.json ### 引用方式若您认为本仓库对研究有所助益，请引用如下论文： @inproceedings{da2023vgt, title={Vision Grid Transformer for Document Layout Analysis}, author={Cheng Da and Chuwei Luo and Qi Zheng and Cong Yao}, year={2023}, booktitle = {ICCV}, } ### HTTP 克隆方式 bash git clone https://www.modelscope.cn/datasets/damo/D4LA.git

提供机构：

maas

创建时间：

2023-08-16

搜集汇总

数据集介绍

背景与挑战

背景概述

D4LA是一个专门用于文档版面分析的数据集，以其多样性和详细性为特点，包含12类文档和27类版面类型。该数据集支持训练和测试，标注格式包括图像和JSON文件，基于2023年ICCV会议的研究'Vision Grid Transformer for Document Layout Analysis'，适用于文档理解、文档AI和OCR等领域。

以上内容由遇见数据集搜集并总结生成