five

JosselinSom/Latex-VLM

收藏
Hugging Face2024-01-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/JosselinSom/Latex-VLM
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: apache-2.0 size_categories: - 1K<n<10K task_categories: - question-answering - visual-question-answering pretty_name: vlm-latex dataset_info: - config_name: default features: - name: id dtype: int64 - name: tex_code dtype: string - name: category dtype: string - name: subject dtype: string - name: output dtype: image - name: asset_0 dtype: string - name: asset_1 dtype: string - name: asset_2 dtype: string - name: asset_3 dtype: string - name: asset_4 dtype: string - name: asset_5 dtype: string - name: asset_6 dtype: string - name: asset_7 dtype: string - name: asset_8 dtype: string - name: asset_9 dtype: string - name: __index_level_0__ dtype: int64 splits: - name: train num_bytes: 11180029.0 num_examples: 382 - name: validation num_bytes: 2508901.0 num_examples: 96 download_size: 13084804 dataset_size: 13688930.0 - config_name: equation features: - name: id dtype: int64 - name: tex_code dtype: string - name: category dtype: string - name: subject dtype: string - name: asset_1 dtype: string - name: asset_2 dtype: string - name: asset_3 dtype: string - name: asset_4 dtype: string - name: asset_5 dtype: string - name: asset_6 dtype: string - name: output dtype: image splits: - name: train num_bytes: 10563678 num_examples: 783 - name: validation num_bytes: 2888346 num_examples: 196 download_size: 13355195 dataset_size: 13452024 - config_name: figure features: - name: id dtype: int64 - name: tex_code dtype: string - name: category dtype: string - name: subject dtype: string - name: asset_1 dtype: string - name: asset_2 dtype: string - name: asset_3 dtype: string - name: asset_4 dtype: string - name: asset_5 dtype: string - name: asset_6 dtype: string - name: output dtype: image splits: - name: train num_bytes: 10563678 num_examples: 783 - name: validation num_bytes: 2888346 num_examples: 196 download_size: 13355195 dataset_size: 13452024 configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - config_name: equation data_files: - split: train path: equation/train-* - split: validation path: equation/validation-* - config_name: figure data_files: - split: train path: equation/train-* - split: validation path: equation/validation-* tags: - biology - finance - economics - math - physics - computer_science - electronics - statistics ---
提供机构:
JosselinSom
原始信息汇总

数据集概述

基本信息

  • 语言: 英语
  • 许可证: Apache 2.0
  • 数据集大小: 1K<n<10K
  • 任务类别:
    • 问答
    • 视觉问答
  • 数据集名称: vlm-latex

数据集配置

  • 配置名称: default

    • 特征:
      • id: int64
      • tex_code: string
      • category: string
      • subject: string
      • output: image
      • asset_0: string
      • asset_1: string
      • asset_2: string
      • asset_3: string
      • asset_4: string
      • asset_5: string
      • asset_6: string
      • asset_7: string
      • asset_8: string
      • asset_9: string
      • index_level_0: int64
    • 分割:
      • train: 382个样本, 11180029字节
      • validation: 96个样本, 2508901字节
    • 下载大小: 13084804字节
    • 数据集大小: 13688930字节
  • 配置名称: equation

    • 特征:
      • id: int64
      • tex_code: string
      • category: string
      • subject: string
      • asset_1: string
      • asset_2: string
      • asset_3: string
      • asset_4: string
      • asset_5: string
      • asset_6: string
      • output: image
    • 分割:
      • train: 783个样本, 10563678字节
      • validation: 196个样本, 2888346字节
    • 下载大小: 13355195字节
    • 数据集大小: 13452024字节
  • 配置名称: figure

    • 特征:
      • id: int64
      • tex_code: string
      • category: string
      • subject: string
      • asset_1: string
      • asset_2: string
      • asset_3: string
      • asset_4: string
      • asset_5: string
      • asset_6: string
      • output: image
    • 分割:
      • train: 783个样本, 10563678字节
      • validation: 196个样本, 2888346字节
    • 下载大小: 13355195字节
    • 数据集大小: 13452024字节

数据文件配置

  • 配置名称: default

    • 数据文件:
      • train: data/train-*
      • validation: data/validation-*
  • 配置名称: equation

    • 数据文件:
      • train: equation/train-*
      • validation: equation/validation-*
  • 配置名称: figure

    • 数据文件:
      • train: equation/train-*
      • validation: equation/validation-*

标签

  • biology
  • finance
  • economics
  • math
  • physics
  • computer_science
  • electronics
  • statistics
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作