bhalladitya/scicap-caption-no-more-than-100-tokens-yes-subfig

Name: bhalladitya/scicap-caption-no-more-than-100-tokens-yes-subfig
Creator: bhalladitya
Published: 2024-07-16 07:15:43
License: 暂无描述

Hugging Face2024-07-16 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/bhalladitya/scicap-caption-no-more-than-100-tokens-yes-subfig

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含两个主要特征：messages和images。messages特征是一个列表，其中包含content和role两个子特征。content特征进一步包含index、text和type三个子特征。images特征是一个图像序列。数据集分为训练集和测试集，分别包含30个样本，训练集和测试集的大小分别为1736251字节和2021842字节。整个数据集的大小为3758093字节，下载大小为3701264字节。

The dataset contains two main features: messages and images. The messages feature is a list that includes two sub-features: content and role. The content feature further contains three sub-features: index, text, and type. The images feature is a sequence of images. The dataset is divided into a training set and a test set, each containing 30 samples. The sizes of the training set and test set are 1736251 bytes and 2021842 bytes, respectively. The total size of the dataset is 3758093 bytes, and the download size is 3701264 bytes.

提供机构：

bhalladitya

原始信息汇总

数据集概述

数据集结构

features:
- messages:
  - content:
    - index: 数据类型为 int64
    - text: 数据类型为 string
    - type: 数据类型为 string
  - role: 数据类型为 string
- images: 序列类型为 image

数据集分割

splits:
- train:
  - num_bytes: 1736251.0
  - num_examples: 30
- test:
  - num_bytes: 2021842.0
  - num_examples: 30

数据集大小

download_size: 3701264
dataset_size: 3758093.0

配置

configs:
- config_name: default
  - data_files:
    - split: train
      - path: data/train-*
    - split: test
      - path: data/test-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集