jinhybr/rvl_cdip_400_train_val_test
收藏Hugging Face2022-11-11 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/jinhybr/rvl_cdip_400_train_val_test
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: image
dtype: image
- name: label
dtype:
class_label:
names:
0: letter
1: form
2: email
3: handwritten
4: advertisement
5: scientific report
6: scientific publication
7: specification
8: file folder
9: news article
10: budget
11: invoice
12: presentation
13: questionnaire
14: resume
15: memo
- name: ground_truth
dtype: string
splits:
- name: test
num_bytes: 197669272.0
num_examples: 1600
- name: train
num_bytes: 781258280.0
num_examples: 6400
- name: validation
num_bytes: 191125740.0
num_examples: 1600
download_size: 1101475597
dataset_size: 1170053292.0
---
# Dataset Card for "rvl_cdip_400_train_val_test"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
### Dataset Format
````
DatasetDict({
test: Dataset({
features: ['image', 'label', 'ground_truth'],
num_rows: 1600
})
train: Dataset({
features: ['image', 'label', 'ground_truth'],
num_rows: 6400
})
validation: Dataset({
features: ['image', 'label', 'ground_truth'],
num_rows: 1600
})
})
````
提供机构:
jinhybr
原始信息汇总
数据集概述
数据集名称
- 名称: rvl_cdip_400_train_val_test
数据集特征
- 特征列表:
- image: 图像数据
- label: 类别标签,包含以下类别:
- 0: letter
- 1: form
- 2: email
- 3: handwritten
- 4: advertisement
- 5: scientific report
- 6: scientific publication
- 7: specification
- 8: file folder
- 9: news article
- 10: budget
- 11: invoice
- 12: presentation
- 13: questionnaire
- 14: resume
- 15: memo
- ground_truth: 字符串数据
数据集拆分
- 拆分详情:
- test:
- num_examples: 1600
- num_bytes: 197669272.0
- train:
- num_examples: 6400
- num_bytes: 781258280.0
- validation:
- num_examples: 1600
- num_bytes: 191125740.0
- test:
数据集大小
- 下载大小: 1101475597
- 数据集总大小: 1170053292.0



