TeeA/ChartQA
收藏Hugging Face2024-05-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/TeeA/ChartQA
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: default
features:
- name: id_image
dtype: string
- name: image
dtype: image
- name: table
dtype: string
- name: chart_type
dtype: string
- name: qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: vi_qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: vi_table
dtype: string
splits:
- name: train
num_bytes: 863105949.214
num_examples: 18317
- name: validation
num_bytes: 50520962.392
num_examples: 1056
- name: test
num_bytes: 70162622.487
num_examples: 1509
download_size: 963469804
dataset_size: 983789534.0929999
- config_name: dictionary
features:
- name: id_image
dtype: string
- name: id_table
dtype: string
- name: cell_value
dtype: string
- name: cell_type
dtype: string
splits:
- name: train
num_bytes: 30090420
num_examples: 523696
- name: validation
num_bytes: 1655359
num_examples: 27796
- name: test
num_bytes: 2299282
num_examples: 39417
download_size: 5615841
dataset_size: 34045061
- config_name: gemini-translate
features:
- name: id_image
dtype: string
- name: image
dtype: image
- name: table
dtype: string
- name: chart_type
dtype: string
- name: qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: vi_qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: gemini_response
dtype: string
splits:
- name: train
num_bytes: 860931675.061
num_examples: 18317
- name: validation
num_bytes: 48369553.096
num_examples: 1056
- name: test
num_bytes: 67244461.03
num_examples: 1509
download_size: 954876861
dataset_size: 976545689.1869999
- config_name: vietnam-3000-gemini
features:
- name: id_image
dtype: string
- name: image
dtype: image
- name: table
dtype: string
- name: chart_type
dtype: string
- name: qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: vi_qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: gemini_response
dtype: string
- name: vi_image
dtype: image
- name: vi_table
dtype: string
- name: vi_title
dtype: string
- name: vi_x_label
dtype: string
- name: vi_y_label
dtype: string
splits:
- name: train
num_bytes: 258211841.452
num_examples: 3332
download_size: 252747999
dataset_size: 258211841.452
- config_name: vietnamese
features:
- name: id_image
dtype: string
- name: image
dtype: image
- name: table
dtype: string
- name: chart_type
dtype: string
- name: qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: vi_qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: title
dtype: string
- name: x_label
dtype: string
- name: y_label
dtype: string
- name: Python
dtype: string
splits:
- name: train
num_bytes: 855859414.646
num_examples: 18317
- name: validation
num_bytes: 47988247.376
num_examples: 1056
- name: test
num_bytes: 66717750.575
num_examples: 1509
download_size: 951094106
dataset_size: 970565412.5970001
- config_name: vin
features:
- name: id_image
dtype: string
- name: image
dtype: image
- name: table
dtype: string
- name: chart_type
dtype: string
- name: qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: vi_qa
list:
- name: label
dtype: string
- name: query
dtype: string
- name: vi_table
dtype: string
splits:
- name: train
num_bytes: 870471769.214
num_examples: 18317
- name: validation
num_bytes: 50822738.392
num_examples: 1056
- name: test
num_bytes: 70633671.487
num_examples: 1509
download_size: 967825853
dataset_size: 991928179.0929999
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
- config_name: dictionary
data_files:
- split: train
path: dictionary/train-*
- split: validation
path: dictionary/validation-*
- split: test
path: dictionary/test-*
- config_name: gemini-translate
data_files:
- split: train
path: gemini-translate/train-*
- split: validation
path: gemini-translate/validation-*
- split: test
path: gemini-translate/test-*
- config_name: vietnam-3000-gemini
data_files:
- split: train
path: vietnam-3000-gemini/train-*
- config_name: vietnamese
data_files:
- split: train
path: vietnamese/train-*
- split: validation
path: vietnamese/validation-*
- split: test
path: vietnamese/test-*
- config_name: vin
data_files:
- split: train
path: vin/train-*
- split: validation
path: vin/validation-*
- split: test
path: vin/test-*
---
Dataset is converted from https://github.com/vis-nlp/ChartQA
vin là tập đã dịch các qa
3000 là tập các chart đã dịch
提供机构:
TeeA
原始信息汇总
数据集概述
数据集配置
默认配置 (default)
- 特征:
id_image: 字符串image: 图像table: 字符串chart_type: 字符串qa: 列表label: 字符串query: 字符串
vi_qa: 列表label: 字符串query: 字符串
vi_table: 字符串
- 分割:
train: 863105949.214 字节, 18317 个样本validation: 50520962.392 字节, 1056 个样本test: 70162622.487 字节, 1509 个样本
- 下载大小: 963469804 字节
- 数据集大小: 983789534.0929999 字节
字典配置 (dictionary)
- 特征:
id_image: 字符串id_table: 字符串cell_value: 字符串cell_type: 字符串
- 分割:
train: 30090420 字节, 523696 个样本validation: 1655359 字节, 27796 个样本test: 2299282 字节, 39417 个样本
- 下载大小: 5615841 字节
- 数据集大小: 34045061 字节
Gemini 翻译配置 (gemini-translate)
- 特征:
id_image: 字符串image: 图像table: 字符串chart_type: 字符串qa: 列表label: 字符串query: 字符串
vi_qa: 列表label: 字符串query: 字符串
gemini_response: 字符串
- 分割:
train: 860931675.061 字节, 18317 个样本validation: 48369553.096 字节, 1056 个样本test: 67244461.03 字节, 1509 个样本
- 下载大小: 954876861 字节
- 数据集大小: 976545689.1869999 字节
Vietnam 3000 Gemini 配置 (vietnam-3000-gemini)
- 特征:
id_image: 字符串image: 图像table: 字符串chart_type: 字符串qa: 列表label: 字符串query: 字符串
vi_qa: 列表label: 字符串query: 字符串
gemini_response: 字符串vi_image: 图像vi_table: 字符串vi_title: 字符串vi_x_label: 字符串vi_y_label: 字符串
- 分割:
train: 258211841.452 字节, 3332 个样本
- 下载大小: 252747999 字节
- 数据集大小: 258211841.452 字节
越南语配置 (vietnamese)
- 特征:
id_image: 字符串image: 图像table: 字符串chart_type: 字符串qa: 列表label: 字符串query: 字符串
vi_qa: 列表label: 字符串query: 字符串
title: 字符串x_label: 字符串y_label: 字符串Python: 字符串
- 分割:
train: 855859414.646 字节, 18317 个样本validation: 47988247.376 字节, 1056 个样本test: 66717750.575 字节, 1509 个样本
- 下载大小: 951094106 字节
- 数据集大小: 970565412.5970001 字节
VIN 配置 (vin)
- 特征:
id_image: 字符串image: 图像table: 字符串chart_type: 字符串qa: 列表label: 字符串query: 字符串
vi_qa: 列表label: 字符串query: 字符串
vi_table: 字符串
- 分割:
train: 870471769.214 字节, 18317 个样本validation: 50822738.392 字节, 1056 个样本test: 70633671.487 字节, 1509 个样本
- 下载大小: 967825853 字节
- 数据集大小: 991928179.0929999 字节



