Visual Question Answering as Reading Comprehension
收藏DataCite Commons2026-01-07 更新2025-04-16 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/7204ab34-fb83-4ffd-a751-22ecf55ec142
下载链接
链接失效反馈官方服务:
资源简介:
Visual question answering (VQA) demands simultaneous comprehension of both the image visual content and natural language questions. In some cases, the reasoning needs the help of common sense or general knowledge which usually appear in the form of text.
提供机构:
TIB
创建时间:
2024-12-16



