VQA-E

arXiv2025-09-30 收录

下载链接：

https://github.com/liqing-ustc/vqa-e

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是从VQA-2衍生而来的，包含了问题-答案对以及相应的解释。这个数据集对于评估视觉问答任务中解释的质量非常有用，尽管它主要支持单字答案的情景。该数据集的任务是视觉问题解答。

This dataset is derived from VQA-v2, which includes question-answer pairs and their corresponding explanations. It is highly useful for evaluating the quality of explanations in visual question answering (VQA) tasks, though it primarily supports scenarios with single-word answers. The core task of this dataset is visual question answering.

搜集汇总

数据集介绍

背景与挑战

背景概述

VQA-E是一个视觉问答解释数据集，包含COCO图像及对应的问答解释数据，用于增强视觉问答系统的解释能力。数据集采用JSON格式存储，包含问题、答案及解释信息，相关研究发表在ECCV 2018上。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集