docvqa_test_subsampled

Name: docvqa_test_subsampled
Creator: maas
Published: 2025-11-27 16:36:15
License: 暂无描述

魔搭社区2025-11-27 更新2025-06-07 收录

下载链接：

https://modelscope.cn/datasets/vidore/docvqa_test_subsampled

下载链接

链接失效反馈

官方服务：

资源简介：

## Dataset Description This is the test set taken from the [DocVQA dataset](https://www.docvqa.org/datasets/docvqa). It includes collected images from the UCSF Industry Documents Library. Questions and answers were manually annotated. Example of data (see viewer) ### Data Curation To ensure homogeneity across our benchmarked datasets, we subsampled the original test set to 500 pairs and renamed the different columns. ### Load the dataset ```python from datasets import load_dataset ds = load_dataset("vidore/docvqa_test_subsampled", split="test") ``` ### Dataset Structure Here is an example of a dataset instance structure: ```json features: - name: questionId dtype: string - name: query dtype: string - name: question_types dtype: 'null' - name: image dtype: image - name: docId dtype: int64 - name: image_filename dtype: string - name: page dtype: string - name: answer dtype: 'null' - name: data_split dtype: string - name: source dtype: string ``` ## Citation Information If you use this dataset in your research, please cite the original dataset as follows: ```latex @article{mathew_docvqa_2020, title = {{DocVQA}: {A} {Dataset} for {VQA} on {Document} {Images}}, copyright = {arXiv.org perpetual, non-exclusive license}, shorttitle = {{DocVQA}}, url = {https://arxiv.org/abs/2007.00398}, doi = {10.48550/ARXIV.2007.00398}, urldate = {2024-05-17}, author = {Mathew, Minesh and Karatzas, Dimosthenis and Jawahar, C. V.}, year = {2020}, keywords = {FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Information Retrieval (cs.IR)}, } ```

## 数据集说明本数据集为取自[DocVQA数据集（DocVQA dataset）](https://www.docvqa.org/datasets/docvqa)的测试集。其图像资源采集自UCSF行业文献库（UCSF Industry Documents Library），问答对均经人工标注。数据示例（详见查看器） ### 数据整理为确保基准数据集间的同质性，我们将原始测试集下采样至500组问答对，并对各列进行了重命名。 ### 数据集加载 python from datasets import load_dataset ds = load_dataset("vidore/docvqa_test_subsampled", split="test") ### 数据集结构以下为数据集实例的结构示例： json features: - name: questionId dtype: string - name: query dtype: string - name: question_types dtype: 'null' - name: image dtype: image - name: docId dtype: int64 - name: image_filename dtype: string - name: page dtype: string - name: answer dtype: 'null' - name: data_split dtype: string - name: source dtype: string ### 引用信息若您在研究中使用本数据集，请按以下格式引用原始数据集： latex @article{mathew_docvqa_2020, title = {{DocVQA}: {A} {Dataset} for {VQA} on {Document} {Images}}, copyright = {arXiv.org perpetual, non-exclusive license}, shorttitle = {{DocVQA}}, url = {https://arxiv.org/abs/2007.00398}, doi = {10.48550/ARXIV.2007.00398}, urldate = {2024-05-17}, author = {Mathew, Minesh and Karatzas, Dimosthenis and Jawahar, C. V.}, year = {2020}, keywords = {FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Information Retrieval (cs.IR)}, }

提供机构：

maas

创建时间：

2025-06-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集