five

Robust VQA (RVQA) Dataset with Language Prior and Compositional Reasoning Labels

收藏
DataCite Commons2024-11-27 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/robust-vqa-rvqa-dataset-language-prior-and-compositional-reasoning-labels
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset is designed to advance research in Visual Question Answering (VQA), specifically addressing challenges related to language priors and compositional reasoning. It incorporates question labels categorizing queries based on their susceptibility to either issue, allowing for targeted evaluation of VQA models. The dataset consists of 33,051 training images and 14,165 validation images, along with 571,244 training questions and 245,087 validation questions. Among the training questions, 313,664 focus on compositional reasoning, while 257,580 pertain to language prior. Similarly, the validation questions are categorized into 134,313 for compositional reasoning and 110,774 for language prior. This dataset serves as a benchmarking tool for evaluating models' performance across these two challenges, providing insights into areas that require further improvement. The comprehensive dataset preparation process, including image collection, caption generation, prompt creation, QA pair generation, and quality control, is outlined in the accompanying algorithm. The dataset is designed to be extensible to other image sources and can be a valuable resource for researchers focusing on VQA tasks involving complex reasoning.
提供机构:
IEEE DataPort
创建时间:
2024-11-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作