docvqa_test_subsampled
收藏魔搭社区2025-11-27 更新2025-06-07 收录
下载链接:
https://modelscope.cn/datasets/vidore/docvqa_test_subsampled
下载链接
链接失效反馈官方服务:
资源简介:
## Dataset Description
This is the test set taken from the [DocVQA dataset](https://www.docvqa.org/datasets/docvqa). It includes collected images from the UCSF Industry Documents Library. Questions and answers were manually annotated.
Example of data (see viewer)
### Data Curation
To ensure homogeneity across our benchmarked datasets, we subsampled the original test set to 500 pairs and renamed the different columns.
### Load the dataset
```python
from datasets import load_dataset
ds = load_dataset("vidore/docvqa_test_subsampled", split="test")
```
### Dataset Structure
Here is an example of a dataset instance structure:
```json
features:
- name: questionId
dtype: string
- name: query
dtype: string
- name: question_types
dtype: 'null'
- name: image
dtype: image
- name: docId
dtype: int64
- name: image_filename
dtype: string
- name: page
dtype: string
- name: answer
dtype: 'null'
- name: data_split
dtype: string
- name: source
dtype: string
```
## Citation Information
If you use this dataset in your research, please cite the original dataset as follows:
```latex
@article{mathew_docvqa_2020,
title = {{DocVQA}: {A} {Dataset} for {VQA} on {Document} {Images}},
copyright = {arXiv.org perpetual, non-exclusive license},
shorttitle = {{DocVQA}},
url = {https://arxiv.org/abs/2007.00398},
doi = {10.48550/ARXIV.2007.00398},
urldate = {2024-05-17},
author = {Mathew, Minesh and Karatzas, Dimosthenis and Jawahar, C. V.},
year = {2020},
keywords = {FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Information Retrieval (cs.IR)},
}
```
## 数据集说明
本数据集为取自[DocVQA数据集(DocVQA dataset)](https://www.docvqa.org/datasets/docvqa)的测试集。其图像资源采集自UCSF行业文献库(UCSF Industry Documents Library),问答对均经人工标注。
数据示例(详见查看器)
### 数据整理
为确保基准数据集间的同质性,我们将原始测试集下采样至500组问答对,并对各列进行了重命名。
### 数据集加载
python
from datasets import load_dataset
ds = load_dataset("vidore/docvqa_test_subsampled", split="test")
### 数据集结构
以下为数据集实例的结构示例:
json
features:
- name: questionId
dtype: string
- name: query
dtype: string
- name: question_types
dtype: 'null'
- name: image
dtype: image
- name: docId
dtype: int64
- name: image_filename
dtype: string
- name: page
dtype: string
- name: answer
dtype: 'null'
- name: data_split
dtype: string
- name: source
dtype: string
### 引用信息
若您在研究中使用本数据集,请按以下格式引用原始数据集:
latex
@article{mathew_docvqa_2020,
title = {{DocVQA}: {A} {Dataset} for {VQA} on {Document} {Images}},
copyright = {arXiv.org perpetual, non-exclusive license},
shorttitle = {{DocVQA}},
url = {https://arxiv.org/abs/2007.00398},
doi = {10.48550/ARXIV.2007.00398},
urldate = {2024-05-17},
author = {Mathew, Minesh and Karatzas, Dimosthenis and Jawahar, C. V.},
year = {2020},
keywords = {FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Information Retrieval (cs.IR)},
}
提供机构:
maas
创建时间:
2025-06-04



