HRVQA dataset underlying the PhD thesis of Kun Li: Interactive Vision-Languag...

Name: HRVQA dataset underlying the PhD thesis of Kun Li: Interactive Vision-Languag...
Creator: 暂无描述
License: 暂无描述

B2FIND2026-03-21 收录

下载链接：

https://b2find.eudat.eu/dataset/433a6ec3-9b46-5fb9-a224-b3e69c0f8d86

下载链接

链接失效反馈

官方服务：

资源简介：

Visual question answering (VQA) is an important and challenging multimodal task in computer vision. We bring VQA task to high-resolution aerial images and propose a large-scale...

5,000+

优质数据集

54 个

任务类型

进入经典数据集