looppayments/tables_qa_token_classification_dataset
收藏Hugging Face2023-12-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/looppayments/tables_qa_token_classification_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: pixel_values
dtype:
array3_d:
shape:
- 3
- 224
- 224
dtype: float32
- name: input_ids
sequence: int64
- name: attention_mask
sequence: int64
- name: tokens
sequence: string
- name: question
dtype: string
- name: bbox
dtype:
array2_d:
shape:
- 512
- 4
dtype: int64
- name: labels
sequence: int64
- name: artifact_qid
dtype: string
splits:
- name: train
num_bytes: 45728183928.665344
num_examples: 71536
- name: test
num_bytes: 11432685215.334654
num_examples: 17885
download_size: 2369838681
dataset_size: 57160869144.0
---
# Dataset Card for "tables_qa_token_classification_dataset"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
looppayments
原始信息汇总
数据集概述
数据集信息
- 特征列表:
pixel_values:三维数组,形状为 (3, 224, 224),数据类型为 float32。input_ids:序列,数据类型为 int64。attention_mask:序列,数据类型为 int64。tokens:序列,数据类型为 string。question:字符串。bbox:二维数组,形状为 (512, 4),数据类型为 int64。labels:序列,数据类型为 int64。artifact_qid:字符串。
数据分割
- 训练集:
- 字节数:45728183928.665344
- 样本数:71536
- 测试集:
- 字节数:11432685215.334654
- 样本数:17885
数据集大小
- 下载大小:2369838681 字节
- 数据集总大小:57160869144.0 字节



