JosselinSom/Latex-VLM
收藏Hugging Face2024-01-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/JosselinSom/Latex-VLM
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: apache-2.0
size_categories:
- 1K<n<10K
task_categories:
- question-answering
- visual-question-answering
pretty_name: vlm-latex
dataset_info:
- config_name: default
features:
- name: id
dtype: int64
- name: tex_code
dtype: string
- name: category
dtype: string
- name: subject
dtype: string
- name: output
dtype: image
- name: asset_0
dtype: string
- name: asset_1
dtype: string
- name: asset_2
dtype: string
- name: asset_3
dtype: string
- name: asset_4
dtype: string
- name: asset_5
dtype: string
- name: asset_6
dtype: string
- name: asset_7
dtype: string
- name: asset_8
dtype: string
- name: asset_9
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 11180029.0
num_examples: 382
- name: validation
num_bytes: 2508901.0
num_examples: 96
download_size: 13084804
dataset_size: 13688930.0
- config_name: equation
features:
- name: id
dtype: int64
- name: tex_code
dtype: string
- name: category
dtype: string
- name: subject
dtype: string
- name: asset_1
dtype: string
- name: asset_2
dtype: string
- name: asset_3
dtype: string
- name: asset_4
dtype: string
- name: asset_5
dtype: string
- name: asset_6
dtype: string
- name: output
dtype: image
splits:
- name: train
num_bytes: 10563678
num_examples: 783
- name: validation
num_bytes: 2888346
num_examples: 196
download_size: 13355195
dataset_size: 13452024
- config_name: figure
features:
- name: id
dtype: int64
- name: tex_code
dtype: string
- name: category
dtype: string
- name: subject
dtype: string
- name: asset_1
dtype: string
- name: asset_2
dtype: string
- name: asset_3
dtype: string
- name: asset_4
dtype: string
- name: asset_5
dtype: string
- name: asset_6
dtype: string
- name: output
dtype: image
splits:
- name: train
num_bytes: 10563678
num_examples: 783
- name: validation
num_bytes: 2888346
num_examples: 196
download_size: 13355195
dataset_size: 13452024
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- config_name: equation
data_files:
- split: train
path: equation/train-*
- split: validation
path: equation/validation-*
- config_name: figure
data_files:
- split: train
path: equation/train-*
- split: validation
path: equation/validation-*
tags:
- biology
- finance
- economics
- math
- physics
- computer_science
- electronics
- statistics
---
提供机构:
JosselinSom
原始信息汇总
数据集概述
基本信息
- 语言: 英语
- 许可证: Apache 2.0
- 数据集大小: 1K<n<10K
- 任务类别:
- 问答
- 视觉问答
- 数据集名称: vlm-latex
数据集配置
-
配置名称: default
- 特征:
- id: int64
- tex_code: string
- category: string
- subject: string
- output: image
- asset_0: string
- asset_1: string
- asset_2: string
- asset_3: string
- asset_4: string
- asset_5: string
- asset_6: string
- asset_7: string
- asset_8: string
- asset_9: string
- index_level_0: int64
- 分割:
- train: 382个样本, 11180029字节
- validation: 96个样本, 2508901字节
- 下载大小: 13084804字节
- 数据集大小: 13688930字节
- 特征:
-
配置名称: equation
- 特征:
- id: int64
- tex_code: string
- category: string
- subject: string
- asset_1: string
- asset_2: string
- asset_3: string
- asset_4: string
- asset_5: string
- asset_6: string
- output: image
- 分割:
- train: 783个样本, 10563678字节
- validation: 196个样本, 2888346字节
- 下载大小: 13355195字节
- 数据集大小: 13452024字节
- 特征:
-
配置名称: figure
- 特征:
- id: int64
- tex_code: string
- category: string
- subject: string
- asset_1: string
- asset_2: string
- asset_3: string
- asset_4: string
- asset_5: string
- asset_6: string
- output: image
- 分割:
- train: 783个样本, 10563678字节
- validation: 196个样本, 2888346字节
- 下载大小: 13355195字节
- 数据集大小: 13452024字节
- 特征:
数据文件配置
-
配置名称: default
- 数据文件:
- train: data/train-*
- validation: data/validation-*
- 数据文件:
-
配置名称: equation
- 数据文件:
- train: equation/train-*
- validation: equation/validation-*
- 数据文件:
-
配置名称: figure
- 数据文件:
- train: equation/train-*
- validation: equation/validation-*
- 数据文件:
标签
- biology
- finance
- economics
- math
- physics
- computer_science
- electronics
- statistics



