five

TeeA/ChartQA

收藏
Hugging Face2024-05-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/TeeA/ChartQA
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: default features: - name: id_image dtype: string - name: image dtype: image - name: table dtype: string - name: chart_type dtype: string - name: qa list: - name: label dtype: string - name: query dtype: string - name: vi_qa list: - name: label dtype: string - name: query dtype: string - name: vi_table dtype: string splits: - name: train num_bytes: 863105949.214 num_examples: 18317 - name: validation num_bytes: 50520962.392 num_examples: 1056 - name: test num_bytes: 70162622.487 num_examples: 1509 download_size: 963469804 dataset_size: 983789534.0929999 - config_name: dictionary features: - name: id_image dtype: string - name: id_table dtype: string - name: cell_value dtype: string - name: cell_type dtype: string splits: - name: train num_bytes: 30090420 num_examples: 523696 - name: validation num_bytes: 1655359 num_examples: 27796 - name: test num_bytes: 2299282 num_examples: 39417 download_size: 5615841 dataset_size: 34045061 - config_name: gemini-translate features: - name: id_image dtype: string - name: image dtype: image - name: table dtype: string - name: chart_type dtype: string - name: qa list: - name: label dtype: string - name: query dtype: string - name: vi_qa list: - name: label dtype: string - name: query dtype: string - name: gemini_response dtype: string splits: - name: train num_bytes: 860931675.061 num_examples: 18317 - name: validation num_bytes: 48369553.096 num_examples: 1056 - name: test num_bytes: 67244461.03 num_examples: 1509 download_size: 954876861 dataset_size: 976545689.1869999 - config_name: vietnam-3000-gemini features: - name: id_image dtype: string - name: image dtype: image - name: table dtype: string - name: chart_type dtype: string - name: qa list: - name: label dtype: string - name: query dtype: string - name: vi_qa list: - name: label dtype: string - name: query dtype: string - name: gemini_response dtype: string - name: vi_image dtype: image - name: vi_table dtype: string - name: vi_title dtype: string - name: vi_x_label dtype: string - name: vi_y_label dtype: string splits: - name: train num_bytes: 258211841.452 num_examples: 3332 download_size: 252747999 dataset_size: 258211841.452 - config_name: vietnamese features: - name: id_image dtype: string - name: image dtype: image - name: table dtype: string - name: chart_type dtype: string - name: qa list: - name: label dtype: string - name: query dtype: string - name: vi_qa list: - name: label dtype: string - name: query dtype: string - name: title dtype: string - name: x_label dtype: string - name: y_label dtype: string - name: Python dtype: string splits: - name: train num_bytes: 855859414.646 num_examples: 18317 - name: validation num_bytes: 47988247.376 num_examples: 1056 - name: test num_bytes: 66717750.575 num_examples: 1509 download_size: 951094106 dataset_size: 970565412.5970001 - config_name: vin features: - name: id_image dtype: string - name: image dtype: image - name: table dtype: string - name: chart_type dtype: string - name: qa list: - name: label dtype: string - name: query dtype: string - name: vi_qa list: - name: label dtype: string - name: query dtype: string - name: vi_table dtype: string splits: - name: train num_bytes: 870471769.214 num_examples: 18317 - name: validation num_bytes: 50822738.392 num_examples: 1056 - name: test num_bytes: 70633671.487 num_examples: 1509 download_size: 967825853 dataset_size: 991928179.0929999 configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* - config_name: dictionary data_files: - split: train path: dictionary/train-* - split: validation path: dictionary/validation-* - split: test path: dictionary/test-* - config_name: gemini-translate data_files: - split: train path: gemini-translate/train-* - split: validation path: gemini-translate/validation-* - split: test path: gemini-translate/test-* - config_name: vietnam-3000-gemini data_files: - split: train path: vietnam-3000-gemini/train-* - config_name: vietnamese data_files: - split: train path: vietnamese/train-* - split: validation path: vietnamese/validation-* - split: test path: vietnamese/test-* - config_name: vin data_files: - split: train path: vin/train-* - split: validation path: vin/validation-* - split: test path: vin/test-* --- Dataset is converted from https://github.com/vis-nlp/ChartQA vin là tập đã dịch các qa 3000 là tập các chart đã dịch
提供机构:
TeeA
原始信息汇总

数据集概述

数据集配置

默认配置 (default)

  • 特征:
    • id_image: 字符串
    • image: 图像
    • table: 字符串
    • chart_type: 字符串
    • qa: 列表
      • label: 字符串
      • query: 字符串
    • vi_qa: 列表
      • label: 字符串
      • query: 字符串
    • vi_table: 字符串
  • 分割:
    • train: 863105949.214 字节, 18317 个样本
    • validation: 50520962.392 字节, 1056 个样本
    • test: 70162622.487 字节, 1509 个样本
  • 下载大小: 963469804 字节
  • 数据集大小: 983789534.0929999 字节

字典配置 (dictionary)

  • 特征:
    • id_image: 字符串
    • id_table: 字符串
    • cell_value: 字符串
    • cell_type: 字符串
  • 分割:
    • train: 30090420 字节, 523696 个样本
    • validation: 1655359 字节, 27796 个样本
    • test: 2299282 字节, 39417 个样本
  • 下载大小: 5615841 字节
  • 数据集大小: 34045061 字节

Gemini 翻译配置 (gemini-translate)

  • 特征:
    • id_image: 字符串
    • image: 图像
    • table: 字符串
    • chart_type: 字符串
    • qa: 列表
      • label: 字符串
      • query: 字符串
    • vi_qa: 列表
      • label: 字符串
      • query: 字符串
    • gemini_response: 字符串
  • 分割:
    • train: 860931675.061 字节, 18317 个样本
    • validation: 48369553.096 字节, 1056 个样本
    • test: 67244461.03 字节, 1509 个样本
  • 下载大小: 954876861 字节
  • 数据集大小: 976545689.1869999 字节

Vietnam 3000 Gemini 配置 (vietnam-3000-gemini)

  • 特征:
    • id_image: 字符串
    • image: 图像
    • table: 字符串
    • chart_type: 字符串
    • qa: 列表
      • label: 字符串
      • query: 字符串
    • vi_qa: 列表
      • label: 字符串
      • query: 字符串
    • gemini_response: 字符串
    • vi_image: 图像
    • vi_table: 字符串
    • vi_title: 字符串
    • vi_x_label: 字符串
    • vi_y_label: 字符串
  • 分割:
    • train: 258211841.452 字节, 3332 个样本
  • 下载大小: 252747999 字节
  • 数据集大小: 258211841.452 字节

越南语配置 (vietnamese)

  • 特征:
    • id_image: 字符串
    • image: 图像
    • table: 字符串
    • chart_type: 字符串
    • qa: 列表
      • label: 字符串
      • query: 字符串
    • vi_qa: 列表
      • label: 字符串
      • query: 字符串
    • title: 字符串
    • x_label: 字符串
    • y_label: 字符串
    • Python: 字符串
  • 分割:
    • train: 855859414.646 字节, 18317 个样本
    • validation: 47988247.376 字节, 1056 个样本
    • test: 66717750.575 字节, 1509 个样本
  • 下载大小: 951094106 字节
  • 数据集大小: 970565412.5970001 字节

VIN 配置 (vin)

  • 特征:
    • id_image: 字符串
    • image: 图像
    • table: 字符串
    • chart_type: 字符串
    • qa: 列表
      • label: 字符串
      • query: 字符串
    • vi_qa: 列表
      • label: 字符串
      • query: 字符串
    • vi_table: 字符串
  • 分割:
    • train: 870471769.214 字节, 18317 个样本
    • validation: 50822738.392 字节, 1056 个样本
    • test: 70633671.487 字节, 1509 个样本
  • 下载大小: 967825853 字节
  • 数据集大小: 991928179.0929999 字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作