Worms

Name: Worms
Creator: maas
Published: 2026-01-07 22:31:32
License: 暂无描述

魔搭社区2026-01-07 更新2024-08-31 收录

下载链接：

https://modelscope.cn/datasets/yuekun/Worms

下载链接

链接失效反馈

官方服务：

资源简介：

# Text Recognition Dataset for Full Images and Cropped Images This repository contains the dataset used for text recognition tasks, including both full images and cropped images. ## Dataset Structure The dataset is organized into two main parts: full_image and crop_image. ### Full Image Each full_image dataset includes the following directories: lmdb: Stores image information. gt_word: Stores word-level annotation information, with each image corresponding to a .txt file. gt_line: Stores line-level annotation information, with each image corresponding to a .txt file. word_line: Stores word and line one-to-one correspondence information. Example structure: ``` train_data ├── full_image │ ├── IC15_train │ │ ├── lmdb │ │ ├── gt_word │ │ ├── gt_line │ │ └── word_line │ ├── CTW1500_train │ │ └── ... │ ├── TotalText_train │ │ └── ... │ ├── HierText_train │ │ └── ... │ ├── HierText_val │ │ └── ... │ ├── IC13_train │ │ └── ... │ ├── OpenImagesV5Text_train │ │ └── ... │ ├── OpenImagesV5Text_val │ │ └── ... │ ├── Uber_train │ │ └── ... │ ├── Uber_val │ │ └── ... │ ├── COCOText_train │ │ └── ... │ ├── COCOText_val │ │ └── ... │ ├── syntext1 │ │ └── ... │ ├── syntext2 │ │ └── ... │ ├── mlt2017 │ │ └── ... │ ├── publaynet_train │ │ └── ... │ ├── publaynet_val │ │ └── ... │ └── laion-ocr │ └── ... test_data ├── full_image │ ├── IC13_test │ │ ├── images │ │ └── word_line │ ├── IC15_test │ │ └── ... │ ├── TotalText_test │ │ └── ... │ └── CTW1500_test-ocr │ └── ... ``` Some datasets may only contain lmdb and gt_word directories. ### Cropped Image Each crop_image dataset includes the following directory: lmdb: Stores image information, where all images and labels are stored. Example structure: ``` train_data ├── crop_image │ ├── MJSynth │ │ ├── data.mdb │ │ └── lock.mdb │ ├── SynthText │ │ └── ... │ ├── SynthText_Add │ │ └── ... │ ├── union14m-new │ │ └── ... │ ├── oov-train │ │ └── ... │ ├── laion-ocr-crop │ │ └── ... │ ├── iam_line_train_lmdb │ │ └── ... │ ├── iam_train │ │ └── ... │ ├── iam_val1 │ │ └── ... │ ├── iam_val2 │ │ └── ... │ ├── cvl_train │ │ └── ... │ ├── rimes_word_train_lmdb │ │ └── ... │ ├── CROHME │ │ └── ... │ ├── HME100K │ │ └── ... │ ├── latex-ocr-train │ │ └── ... │ ├── latex-ocr-val │ │ └── ... │ ├── Synth-arxiv1 │ │ └── ... │ ├── Synth-arxiv2 │ │ └── ... │ ├── Synth-arxiv3 │ │ └── ... │ ├── Synth-arxiv4 │ │ └── ... │ ├── Synth-arxiv5 │ │ └── ... │ ├── Synth-arxiv6 │ │ └── ... │ ├── Synth-arxiv7 │ │ └── ... │ ├── Synth-arxiv8 │ │ └── ... │ ├── Synth-arxiv9 │ │ └── ... │ ├── Synth-arxiv10 │ │ └── ... │ ├── Synth-arxiv11 │ │ └── ... test_data ├── crop_image │ ├── CUTE80 │ │ ├── data.mdb │ │ └── lock.mdb │ ├── IC13_857 │ │ └── ... │ ├── IC15_1811 │ │ └── ... │ ├── IIIT5k_3000 │ │ └── ... │ ├── SVT │ │ └── ... │ ├── SVTP │ │ └── ... │ ├── IC13_1015 │ │ └── ... │ ├── IC15_2077 │ │ └── ... │ ├── iam_test │ │ └── ... │ ├── cvl_test │ │ └── ... │ ├── rimes_word_test_lmdb │ │ └── ... │ ├── latex-ocr │ │ └── ... ``` ## How to Use Two scripts are provided to correctly read the dataset information: lmdb_full_img.py: Script to read full image data. lmdb_crop_img.py: Script to read cropped image data. To use the dataset, follow these steps: ```sh cd train cat full_image.tar.gz.part-* > full_image.tar.gz cat crop_image.tar.gz.part-* > crop_image.tar.gz tar -xzvf full_image.tar.gz tar -xzvf crop_image.tar.gz ``` This will reconstruct and unpack the dataset files for use. Download Method: :modelscope-code[]{type="git"}

# 适用于整图与裁剪图像的文本识别数据集本代码仓库包含用于文本识别任务的数据集，涵盖整幅图像与裁剪图像两类数据。 ## 数据集结构本数据集分为两大核心模块：整图数据集（full_image）与裁剪图数据集（crop_image）。 ### 整图数据集每组整图数据集包含以下目录： lmdb：存储图像相关信息的目录 gt_word：存储词级基准真值标注信息的目录，每张图像对应一个.txt标注文件 gt_line：存储行级基准真值标注信息的目录，每张图像对应一个.txt标注文件 word_line：存储词与行一一对应关系的目录示例目录结构如下： train_data ├── full_image │ ├── IC15_train │ │ ├── lmdb │ │ ├── gt_word │ │ ├── gt_line │ │ └── word_line │ ├── CTW1500_train │ │ └── ... │ ├── TotalText_train │ │ └── ... │ ├── HierText_train │ │ └── ... │ ├── HierText_val │ │ └── ... │ ├── IC13_train │ │ └── ... │ ├── OpenImagesV5Text_train │ │ └── ... │ ├── OpenImagesV5Text_val │ │ └── ... │ ├── Uber_train │ │ └── ... │ ├── Uber_val │ │ └── ... │ ├── COCOText_train │ │ └── ... │ ├── COCOText_val │ │ └── ... │ ├── syntext1 │ │ └── ... │ ├── syntext2 │ │ └── ... │ ├── mlt2017 │ │ └── ... │ ├── publaynet_train │ │ └── ... │ ├── publaynet_val │ │ └── ... │ └── laion-ocr │ └── ... test_data ├── full_image │ ├── IC13_test │ │ ├── images │ │ └── word_line │ ├── IC15_test │ │ └── ... │ ├── TotalText_test │ │ └── ... │ └── CTW1500_test-ocr │ └── ... 部分数据集可能仅包含lmdb与gt_word两个目录。 ### 裁剪图数据集每组裁剪图数据集包含以下目录： lmdb：存储图像与标注信息的目录，所有图像及标签均存储于此示例目录结构如下： train_data ├── crop_image │ ├── MJSynth │ │ ├── data.mdb │ │ └── lock.mdb │ ├── SynthText │ │ └── ... │ ├── SynthText_Add │ │ └── ... │ ├── union14m-new │ │ └── ... │ ├── oov-train │ │ └── ... │ ├── laion-ocr-crop │ │ └── ... │ ├── iam_line_train_lmdb │ │ └── ... │ ├── iam_train │ │ └── ... │ ├── iam_val1 │ │ └── ... │ ├── iam_val2 │ │ └── ... │ ├── cvl_train │ │ └── ... │ ├── rimes_word_train_lmdb │ │ └── ... │ ├── CROHME │ │ └── ... │ ├── HME100K │ │ └── ... │ ├── latex-ocr-train │ │ └── ... │ ├── latex-ocr-val │ │ └── ... │ ├── Synth-arxiv1 │ │ └── ... │ ├── Synth-arxiv2 │ │ └── ... │ ├── Synth-arxiv3 │ │ └── ... │ ├── Synth-arxiv4 │ │ └── ... │ ├── Synth-arxiv5 │ │ └── ... │ ├── Synth-arxiv6 │ │ └── ... │ ├── Synth-arxiv7 │ │ └── ... │ ├── Synth-arxiv8 │ │ └── ... │ ├── Synth-arxiv9 │ │ └── ... │ ├── Synth-arxiv10 │ │ └── ... │ ├── Synth-arxiv11 │ │ └── ... test_data ├── crop_image │ ├── CUTE80 │ │ ├── data.mdb │ │ └── lock.mdb │ ├── IC13_857 │ │ └── ... │ ├── IC15_1811 │ │ └── ... │ ├── IIIT5k_3000 │ │ └── ... │ ├── SVT │ │ └── ... │ ├── SVTP │ │ └── ... │ ├── IC13_1015 │ │ └── ... │ ├── IC15_2077 │ │ └── ... │ ├── iam_test │ │ └── ... │ ├── cvl_test │ │ └── ... │ ├── rimes_word_test_lmdb │ │ └── ... │ └── latex-ocr │ └── ... ## 使用方法本仓库提供两个脚本以正确读取数据集信息： lmdb_full_img.py：用于读取整图数据集的脚本 lmdb_crop_img.py：用于读取裁剪图数据集的脚本如需使用该数据集，请遵循以下步骤： sh cd train cat full_image.tar.gz.part-* > full_image.tar.gz cat crop_image.tar.gz.part-* > crop_image.tar.gz tar -xzvf full_image.tar.gz tar -xzvf crop_image.tar.gz 上述命令将重组并解压数据集文件，以供后续使用。下载方式： :modelscope-code[]{type="git"}

提供机构：

maas

创建时间：

2024-07-03

5,000+

优质数据集

54 个

任务类型

进入经典数据集