infovqa_test_subsampled
收藏魔搭社区2026-01-06 更新2025-06-07 收录
下载链接:
https://modelscope.cn/datasets/vidore/infovqa_test_subsampled
下载链接
链接失效反馈官方服务:
资源简介:
## Dataset Description
This is the test set taken from the [InfoVQA dataset](https://www.docvqa.org/datasets/infographicvqa). includes infographics collected from the Internet using the search query “*infographics*”. Questions and answers were manually annotated.
Questions and answers were manually annotated.
Example of data : (see viewer)
### Data Curation
To ensure homogeneity across our benchmarked datasets, we subsampled the original test set to 500 pairs and renamed the different columns.
### Load the dataset
```python
from datasets import load_dataset
ds = load_dataset("vidore/infovqa_test_subsampled", split="test")
```
### Dataset Structure
Here is an example of a dataset instance structure:
```xml
features:
- name: questionId
dtype: string
- name: query
dtype: string
- name: answer
dtype: 'null'
- name: answer_type
dtype: 'null'
- name: image
dtype: image
- name: image_filename
dtype: string
- name: operation/reasoning
dtype: 'null'
- name: ocr
dtype: string
- name: data_split
dtype: string
- name: source
dtype: string
```
## Citation Information
If you use this dataset in your research, please cite the original dataset as follows:
```latex
@misc{mathew_infographicvqa_2021,
title = {{InfographicVQA}},
copyright = {arXiv.org perpetual, non-exclusive license},
url = {https://arxiv.org/abs/2104.12756},
doi = {10.48550/ARXIV.2104.12756},
urldate = {2024-06-02},
publisher = {arXiv},
author = {Mathew, Minesh and Bagal, Viraj and Tito, Rubèn Pérez and Karatzas, Dimosthenis and Valveny, Ernest and Jawahar, C. V},
year = {2021},
note = {Version Number: 2},
keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV)},
}
```
## 数据集描述
本数据集为取自[InfoVQA数据集(InfoVQA dataset)](https://www.docvqa.org/datasets/infographicvqa)的测试集,包含通过搜索关键词"*infographics*"从互联网采集的信息图表。问答对均经人工标注。
数据示例:(请查看查看器)
### 数据整理
为确保本基准测试数据集间的同质性,我们将原始测试集下采样至500组问答对,并对各列进行了重命名。
### 数据集加载
python
from datasets import load_dataset
ds = load_dataset("vidore/infovqa_test_subsampled", split="test")
### 数据集结构
以下为数据集实例的结构示例:
xml
features:
- 字段名:问题ID(questionId),数据类型:字符串
- 字段名:查询文本(query),数据类型:字符串
- 字段名:答案(answer),数据类型:空(null)
- 字段名:答案类型(answer_type),数据类型:空(null)
- 字段名:图像(image),数据类型:图像
- 字段名:图像文件名(image_filename),数据类型:字符串
- 字段名:操作/推理逻辑(operation/reasoning),数据类型:空(null)
- 字段名:光学字符识别(Optical Character Recognition,OCR)结果(ocr),数据类型:字符串
- 字段名:数据拆分(data_split),数据类型:字符串
- 字段名:来源(source),数据类型:字符串
## 引用信息
若您在研究中使用本数据集,请按如下格式引用原始数据集:
latex
@misc{mathew_infographicvqa_2021,
title = {{InfographicVQA}},
copyright = {arXiv.org perpetual, non-exclusive license},
url = {https://arxiv.org/abs/2104.12756},
doi = {10.48550/ARXIV.2104.12756},
urldate = {2024-06-02},
publisher = {arXiv},
author = {Mathew, Minesh and Bagal, Viraj and Tito, Rubèn Pérez and Karatzas, Dimosthenis and Valveny, Ernest and Jawahar, C. V},
year = {2021},
note = {Version Number: 2},
keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV)},
}
提供机构:
maas
创建时间:
2025-06-04



