five

VietTravelVQA

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/fvxtpnh8mh
下载链接
链接失效反馈
官方服务:
资源简介:
Objective: This dataset is designed to benchmark and improve Visual Question Answering (VQA) systems in the context of Vietnamese tourism and cultural heritage. It addresses the lack of high-quality, regionally specific multimodal data for Southeast Asia. Data Content: The dataset comprises thousands of images sourced from Wikimedia Commons, paired with human-verified question-answer sets in Vietnamese. The questions cover five levels of complexity, ranging from basic object identification to deep cultural reasoning. Methodology: 1. Sourcing: Legally compliant images were filtered from Wikimedia Commons. 2. Annotation: Expert annotators generated QA pairs, focusing on architectural details, historical significance, and spatial reasoning. 3. Validation: Data was cleaned using automated scripts to ensure 100% synchronization between metadata (JSON) and image files, with factual auditing via Large Multimodal Models (LMMs). Usage: The data is split into train and test sets (.json). It is intended for training, fine-tuning, and evaluating Vision-Language Models (VLMs) on localized cultural contexts.
创建时间:
2026-03-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作