xDAN-Vision/Cambrian10M_For_Mantis
收藏Hugging Face2024-07-12 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/xDAN-Vision/Cambrian10M_For_Mantis
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个配置,每个配置都包含id、图像(包含二进制数据和路径)、对话(包含角色和内容)以及来源信息。数据集主要用于训练集,涵盖了不同数量的字节和示例。这些数据集可能用于多模态任务,结合了图像和文本的对话数据。
The dataset contains multiple configurations, each of which includes id, images (containing binary data and paths), conversations (containing roles and content), and source information. The dataset is primarily used for training sets, covering varying amounts of bytes and examples. These datasets may be used for multimodal tasks, combining image and text dialogue data.
提供机构:
xDAN-Vision
原始信息汇总
数据集概述
数据集配置
ShareGPT-4o
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 175154584
- num_examples: 57289
- train:
- 下载大小: 102869620
- 数据集大小: 175154584
ai2d
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 2952877
- num_examples: 4060
- train:
- 下载大小: 898466
- 数据集大小: 2952877
allava0
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42963289
- num_examples: 43946
- train:
- 下载大小: 20510657
- 数据集大小: 42963289
allava1
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42877172
- num_examples: 43946
- train:
- 下载大小: 20524162
- 数据集大小: 42877172
allava10
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42964031
- num_examples: 43946
- train:
- 下载大小: 20457633
- 数据集大小: 42964031
allava11
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 43135344
- num_examples: 43946
- train:
- 下载大小: 20610439
- 数据集大小: 43135344
allava12
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 43001334
- num_examples: 43945
- train:
- 下载大小: 20505843
- 数据集大小: 43001334
allava13
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42689525
- num_examples: 43945
- train:
- 下载大小: 20405278
- 数据集大小: 42689525
allava14
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42999796
- num_examples: 43945
- train:
- 下载大小: 20496264
- 数据集大小: 42999796
allava2
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42844068
- num_examples: 43946
- train:
- 下载大小: 20458312
- 数据集大小: 42844068
allava3
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42799435
- num_examples: 43946
- train:
- 下载大小: 20443667
- 数据集大小: 42799435
allava4
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42892737
- num_examples: 43946
- train:
- 下载大小: 20495515
- 数据集大小: 42892737
allava5
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42973849
- num_examples: 43946
- train:
- 下载大小: 20570964
- 数据集大小: 42973849
allava6
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42820568
- num_examples: 43946
- train:
- 下载大小: 20418922
- 数据集大小: 42820568
allava7
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42799180
- num_examples: 43946
- train:
- 下载大小: 20423888
- 数据集大小: 42799180
allava8
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42788182
- num_examples: 43946
- train:
- 下载大小: 20451954
- 数据集大小: 42788182
allava9
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 42956182
- num_examples: 43946
- train:
- 下载大小: 20511079
- 数据集大小: 42956182
arxivqa
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 65697170
- num_examples: 54399
- train:
- 下载大小: 26608192
- 数据集大小: 65697170
chartqa
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 6385441
- num_examples: 18317
- train:
- 下载大小: 2292993
- 数据集大小: 6385441
design2code
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 38299768
- num_examples: 484
- train:
- 下载大小: 10815153
- 数据集大小: 38299768
docvqa
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 6074591
- num_examples: 10194
- train:
- 下载大小: 2078997
- 数据集大小: 6074591
dvqa
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 133124289
- num_examples: 197071
- train:
- 下载大小: 24090659
- 数据集大小: 133124289
gpt4v-dataset
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 25346678
- num_examples: 11474
- train:
- 下载大小: 7215982
- 数据集大小: 25346678
llavar
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 12626164
- num_examples: 19800
- train:
- 下载大小: 6517180
- 数据集大小: 12626164
ocr_vqa
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 41129279
- num_examples: 80000
- train:
- 下载大小: 9610901
- 数据集大小: 41129279
screen_qa
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 11253735
- num_examples: 33161
- train:
- 下载大小: 4316588
- 数据集大小: 11253735
share_textvqa
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 854317
- num_examples: 500
- train:
- 下载大小: 421004
- 数据集大小: 854317
synthdog0
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 51702342
- num_examples: 100000
- train:
- 下载大小: 31034309
- 数据集大小: 51702342
synthdog1
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 51808890
- num_examples: 100000
- train:
- 下载大小: 31210694
- 数据集大小: 51808890
synthdog2
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 51747705
- num_examples: 100000
- train:
- 下载大小: 31073201
- 数据集大小: 51747705
synthdog3
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 51760959
- num_examples: 100000
- train:
- 下载大小: 31087012
- 数据集大小: 51760959
synthdog4
- 特征:
- id: string
- images:
- bytes: binary
- path: string
- conversation:
- role: string
- content: string
- source: string
- 分割:
- train:
- num_bytes: 51816065
- num_examples: 100000
- train:
- 下载大小: 31076731
- 数据集大小: 51816065
websight0
- 特征:
- id: string



