Multimodal-Fatima/OK-VQA_train

Name: Multimodal-Fatima/OK-VQA_train
Creator: Multimodal-Fatima
Published: 2023-03-23 22:30:06
License: 暂无描述

Hugging Face2023-03-23 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/Multimodal-Fatima/OK-VQA_train

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: image dtype: image - name: question_type dtype: string - name: confidence dtype: int32 - name: answers sequence: string - name: answers_original list: - name: answer dtype: string - name: raw_answer dtype: string - name: answer_confidence dtype: string - name: answer_id dtype: int64 - name: id_image dtype: int64 - name: answer_type dtype: string - name: question_id dtype: int64 - name: question dtype: string - name: id dtype: int64 - name: clip_tags_ViT_L_14 sequence: string - name: clip_tags_LAION_ViT_H_14_2B sequence: string - name: blip_caption_beam_5 dtype: string - name: LLM_Description_gpt3_downstream_tasks_visual_genome_ViT_L_14 sequence: string - name: LLM_Description_gpt3_downstream_tasks_visual_genome_LAION-ViT-H-14-2B sequence: string - name: DETA_detections_deta_swin_large_o365_coco_classes list: - name: attribute dtype: string - name: box sequence: float32 - name: label dtype: string - name: location dtype: string - name: ratio dtype: float32 - name: size dtype: string - name: tag dtype: string - name: DETA_detections_deta_swin_large_o365_coco_classes_caption_module_random list: - name: attribute dtype: string - name: box sequence: float64 - name: captions_module sequence: string - name: captions_module_filter sequence: string - name: label dtype: string - name: location dtype: string - name: ratio dtype: float64 - name: size dtype: string - name: tag dtype: string splits: - name: train num_bytes: 1686555802.0 num_examples: 9009 download_size: 1572400067 dataset_size: 1686555802.0 --- # Dataset Card for "OK-VQA_train" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

提供机构：

Multimodal-Fatima

原始信息汇总

数据集概述

数据集特征

image：图像数据
question_type：问题类型，字符串类型
confidence：置信度，整数32位类型
answers：答案序列，字符串类型
answers_original：原始答案列表
- answer：答案，字符串类型
- raw_answer：原始答案，字符串类型
- answer_confidence：答案置信度，字符串类型
- answer_id：答案ID，整数64位类型
id_image：图像ID，整数64位类型
answer_type：答案类型，字符串类型
question_id：问题ID，整数64位类型
question：问题内容，字符串类型
id：ID，整数64位类型
clip_tags_ViT_L_14：标签序列，字符串类型
clip_tags_LAION_ViT_H_14_2B：标签序列，字符串类型
blip_caption_beam_5：标题，字符串类型
LLM_Description_gpt3_downstream_tasks_visual_genome_ViT_L_14：描述序列，字符串类型
LLM_Description_gpt3_downstream_tasks_visual_genome_LAION-ViT-H-14-2B：描述序列，字符串类型
DETA_detections_deta_swin_large_o365_coco_classes：检测列表
- attribute：属性，字符串类型
- box：边界框序列，浮点32位类型
- label：标签，字符串类型
- location：位置，字符串类型
- ratio：比例，浮点32位类型
- size：尺寸，字符串类型
- tag：标签，字符串类型
DETA_detections_deta_swin_large_o365_coco_classes_caption_module_random：检测列表
- attribute：属性，字符串类型
- box：边界框序列，浮点64位类型
- captions_module：标题模块序列，字符串类型
- captions_module_filter：标题模块过滤序列，字符串类型
- label：标签，字符串类型
- location：位置，字符串类型
- ratio：比例，浮点64位类型
- size：尺寸，字符串类型
- tag：标签，字符串类型

数据集分割

train：训练集
- num_bytes：数据大小为1686555802.0字节
- num_examples：包含9009个样本

数据集大小

download_size：下载大小为1572400067字节
dataset_size：数据集大小为1686555802.0字节

搜集汇总

数据集介绍

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集