sitloboi2012/rvl_cdip_large_dataset
收藏Hugging Face2023-10-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/sitloboi2012/rvl_cdip_large_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- split: validate
path: data/validate-*
dataset_info:
features:
- name: image
dtype: image
- name: label
dtype:
class_label:
names:
'0': letter
'1': form
'2': email
'3': handwritten
'4': advertisement
'5': scientific report
'6': scientific publication
'7': specification
'8': file folder
'9': news article
'10': budget
'11': invoice
'12': presentation
'13': questionnaire
'14': resume
'15': memo
splits:
- name: train
num_bytes: 3694582118.36
num_examples: 30400
- name: test
num_bytes: 388902596.88
num_examples: 3200
- name: validate
num_bytes: 388902596.88
num_examples: 3200
download_size: 4204560106
dataset_size: 4472387312.12
---
# Dataset Card for "rvl_cdip_large_dataset"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
sitloboi2012
原始信息汇总
数据集概述
配置
- 默认配置:
- 数据文件:
- 训练集(train):
data/train-* - 测试集(test):
data/test-* - 验证集(validate):
data/validate-*
- 训练集(train):
- 数据文件:
数据集信息
-
特征:
- 图像(image):数据类型为图像
- 标签(label):数据类型为类别标签,包含以下类别:
- 0: letter
- 1: form
- 2: email
- 3: handwritten
- 4: advertisement
- 5: scientific report
- 6: scientific publication
- 7: specification
- 8: file folder
- 9: news article
- 10: budget
- 11: invoice
- 12: presentation
- 13: questionnaire
- 14: resume
- 15: memo
-
分割:
- 训练集(train):
- 字节数:3694582118.36
- 样本数:30400
- 测试集(test):
- 字节数:388902596.88
- 样本数:3200
- 验证集(validate):
- 字节数:388902596.88
- 样本数:3200
- 训练集(train):
-
下载大小:4204560106字节
-
数据集大小:4472387312.12字节



