gokaygokay/random_instruct_docci
收藏Hugging Face2024-05-10 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/gokaygokay/random_instruct_docci
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: image
dtype: image
- name: example_id
dtype: string
- name: description
dtype: string
- name: qa
list:
- name: answer
dtype: string
- name: question
dtype: string
splits:
- name: train
num_bytes: 7060345226.106
num_examples: 13647
- name: test
num_bytes: 254553007
num_examples: 500
- name: val
num_bytes: 261171671
num_examples: 500
download_size: 7544082762
dataset_size: 7576069904.106
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- split: val
path: data/val-*
license: apache-2.0
---
The dataset consists of a collection of general questions and orders/instructions added to [google/docci](https://huggingface.co/datasets/google/docci) dataset. 4000 test example moved into train dataset.
提供机构:
gokaygokay
原始信息汇总
数据集概述
数据集特征
- image: 图像数据类型
- example_id: 字符串数据类型
- description: 字符串数据类型
- qa: 包含以下子特征
- answer: 字符串数据类型
- question: 字符串数据类型
数据集分割
- train: 包含13647个样本,占用7060345226.106字节
- test: 包含500个样本,占用254553007字节
- val: 包含500个样本,占用261171671字节
数据集大小
- 下载大小: 7544082762字节
- 数据集大小: 7576069904.106字节
数据文件配置
- config_name: default
- data_files:
- train: 路径为
data/train-* - test: 路径为
data/test-* - val: 路径为
data/val-*
- train: 路径为
许可证
- license: apache-2.0



