VQA v1

arXiv2025-09-30 收录

下载链接：

https://eval.ai/web/challenges/challenge-page/830/overview

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是首个大规模、开放式的自由形式数据集，旨在评估视觉问答（VQA）方法的能力，如细粒度检测、识别和计数。它包含了614.2万个问题和204.7万张图片。此外，该数据集为问题提供了多个真实答案，并分为训练、验证、测试开发、标准测试、挑战测试和保留测试等多个部分。其规模属于大型，任务类型为视觉问答。

This dataset is the first large-scale, open-domain free-form dataset designed to evaluate the capabilities of Visual Question Answering (VQA) methods, such as fine-grained detection, recognition and counting. It contains 6.142 million questions and 2.047 million images. Additionally, the dataset provides multiple ground-truth answers for each question, and is divided into multiple subsets including training, validation, test-dev, standard test, challenge test and holdout test. As a large-scale dataset, its targeted task is Visual Question Answering.

搜集汇总

数据集介绍

背景与挑战

背景概述

VQA v1是Visual Question Answering Challenge的第一版数据集，用于评估AI系统在回答关于图像的开放性问题上的能力。VQA v2.0是其改进版本，减少了语言偏见并增加了数据量。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集