five

infovqa_test_subsampled

收藏
魔搭社区2026-01-06 更新2025-06-07 收录
下载链接:
https://modelscope.cn/datasets/vidore/infovqa_test_subsampled
下载链接
链接失效反馈
官方服务:
资源简介:
## Dataset Description This is the test set taken from the [InfoVQA dataset](https://www.docvqa.org/datasets/infographicvqa). includes infographics collected from the Internet using the search query “*infographics*”. Questions and answers were manually annotated. Questions and answers were manually annotated. Example of data : (see viewer) ### Data Curation To ensure homogeneity across our benchmarked datasets, we subsampled the original test set to 500 pairs and renamed the different columns. ### Load the dataset ```python from datasets import load_dataset ds = load_dataset("vidore/infovqa_test_subsampled", split="test") ``` ### Dataset Structure Here is an example of a dataset instance structure: ```xml features: - name: questionId dtype: string - name: query dtype: string - name: answer dtype: 'null' - name: answer_type dtype: 'null' - name: image dtype: image - name: image_filename dtype: string - name: operation/reasoning dtype: 'null' - name: ocr dtype: string - name: data_split dtype: string - name: source dtype: string ``` ## Citation Information If you use this dataset in your research, please cite the original dataset as follows: ```latex @misc{mathew_infographicvqa_2021, title = {{InfographicVQA}}, copyright = {arXiv.org perpetual, non-exclusive license}, url = {https://arxiv.org/abs/2104.12756}, doi = {10.48550/ARXIV.2104.12756}, urldate = {2024-06-02}, publisher = {arXiv}, author = {Mathew, Minesh and Bagal, Viraj and Tito, Rubèn Pérez and Karatzas, Dimosthenis and Valveny, Ernest and Jawahar, C. V}, year = {2021}, note = {Version Number: 2}, keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV)}, } ```

## 数据集描述 本数据集为取自[InfoVQA数据集(InfoVQA dataset)](https://www.docvqa.org/datasets/infographicvqa)的测试集,包含通过搜索关键词"*infographics*"从互联网采集的信息图表。问答对均经人工标注。 数据示例:(请查看查看器) ### 数据整理 为确保本基准测试数据集间的同质性,我们将原始测试集下采样至500组问答对,并对各列进行了重命名。 ### 数据集加载 python from datasets import load_dataset ds = load_dataset("vidore/infovqa_test_subsampled", split="test") ### 数据集结构 以下为数据集实例的结构示例: xml features: - 字段名:问题ID(questionId),数据类型:字符串 - 字段名:查询文本(query),数据类型:字符串 - 字段名:答案(answer),数据类型:空(null) - 字段名:答案类型(answer_type),数据类型:空(null) - 字段名:图像(image),数据类型:图像 - 字段名:图像文件名(image_filename),数据类型:字符串 - 字段名:操作/推理逻辑(operation/reasoning),数据类型:空(null) - 字段名:光学字符识别(Optical Character Recognition,OCR)结果(ocr),数据类型:字符串 - 字段名:数据拆分(data_split),数据类型:字符串 - 字段名:来源(source),数据类型:字符串 ## 引用信息 若您在研究中使用本数据集,请按如下格式引用原始数据集: latex @misc{mathew_infographicvqa_2021, title = {{InfographicVQA}}, copyright = {arXiv.org perpetual, non-exclusive license}, url = {https://arxiv.org/abs/2104.12756}, doi = {10.48550/ARXIV.2104.12756}, urldate = {2024-06-02}, publisher = {arXiv}, author = {Mathew, Minesh and Bagal, Viraj and Tito, Rubèn Pérez and Karatzas, Dimosthenis and Valveny, Ernest and Jawahar, C. V}, year = {2021}, note = {Version Number: 2}, keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV)}, }
提供机构:
maas
创建时间:
2025-06-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作