Multimodal-Fatima/OK-VQA_train
收藏Hugging Face2023-03-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Multimodal-Fatima/OK-VQA_train
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: image
dtype: image
- name: question_type
dtype: string
- name: confidence
dtype: int32
- name: answers
sequence: string
- name: answers_original
list:
- name: answer
dtype: string
- name: raw_answer
dtype: string
- name: answer_confidence
dtype: string
- name: answer_id
dtype: int64
- name: id_image
dtype: int64
- name: answer_type
dtype: string
- name: question_id
dtype: int64
- name: question
dtype: string
- name: id
dtype: int64
- name: clip_tags_ViT_L_14
sequence: string
- name: clip_tags_LAION_ViT_H_14_2B
sequence: string
- name: blip_caption_beam_5
dtype: string
- name: LLM_Description_gpt3_downstream_tasks_visual_genome_ViT_L_14
sequence: string
- name: LLM_Description_gpt3_downstream_tasks_visual_genome_LAION-ViT-H-14-2B
sequence: string
- name: DETA_detections_deta_swin_large_o365_coco_classes
list:
- name: attribute
dtype: string
- name: box
sequence: float32
- name: label
dtype: string
- name: location
dtype: string
- name: ratio
dtype: float32
- name: size
dtype: string
- name: tag
dtype: string
- name: DETA_detections_deta_swin_large_o365_coco_classes_caption_module_random
list:
- name: attribute
dtype: string
- name: box
sequence: float64
- name: captions_module
sequence: string
- name: captions_module_filter
sequence: string
- name: label
dtype: string
- name: location
dtype: string
- name: ratio
dtype: float64
- name: size
dtype: string
- name: tag
dtype: string
splits:
- name: train
num_bytes: 1686555802.0
num_examples: 9009
download_size: 1572400067
dataset_size: 1686555802.0
---
# Dataset Card for "OK-VQA_train"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Multimodal-Fatima
原始信息汇总
数据集概述
数据集特征
- image:图像数据
- question_type:问题类型,字符串类型
- confidence:置信度,整数32位类型
- answers:答案序列,字符串类型
- answers_original:原始答案列表
- answer:答案,字符串类型
- raw_answer:原始答案,字符串类型
- answer_confidence:答案置信度,字符串类型
- answer_id:答案ID,整数64位类型
- id_image:图像ID,整数64位类型
- answer_type:答案类型,字符串类型
- question_id:问题ID,整数64位类型
- question:问题内容,字符串类型
- id:ID,整数64位类型
- clip_tags_ViT_L_14:标签序列,字符串类型
- clip_tags_LAION_ViT_H_14_2B:标签序列,字符串类型
- blip_caption_beam_5:标题,字符串类型
- LLM_Description_gpt3_downstream_tasks_visual_genome_ViT_L_14:描述序列,字符串类型
- LLM_Description_gpt3_downstream_tasks_visual_genome_LAION-ViT-H-14-2B:描述序列,字符串类型
- DETA_detections_deta_swin_large_o365_coco_classes:检测列表
- attribute:属性,字符串类型
- box:边界框序列,浮点32位类型
- label:标签,字符串类型
- location:位置,字符串类型
- ratio:比例,浮点32位类型
- size:尺寸,字符串类型
- tag:标签,字符串类型
- DETA_detections_deta_swin_large_o365_coco_classes_caption_module_random:检测列表
- attribute:属性,字符串类型
- box:边界框序列,浮点64位类型
- captions_module:标题模块序列,字符串类型
- captions_module_filter:标题模块过滤序列,字符串类型
- label:标签,字符串类型
- location:位置,字符串类型
- ratio:比例,浮点64位类型
- size:尺寸,字符串类型
- tag:标签,字符串类型
数据集分割
- train:训练集
- num_bytes:数据大小为1686555802.0字节
- num_examples:包含9009个样本
数据集大小
- download_size:下载大小为1572400067字节
- dataset_size:数据集大小为1686555802.0字节
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



