five

SGN and EQA

收藏
arXiv2025-09-30 收录
下载链接:
https://devendrachaplot.github.io/projects/EMML
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集专注于语言条件下的视觉导航任务,特别是语义目标导航(SGN)和具身问答(EQA)。该数据集旨在评估模型在SGN和EQA任务上的表现,结果表明,双重注意力模型显著优于基线模型。数据集规模包括简单设置下的1000万帧和困难设置下的5000万帧,任务旨在联合学习语义目标导航和具身问答。

This dataset focuses on vision-and-language navigation tasks, specifically Semantic Goal Navigation (SGN) and Embodied Question Answering (EQA). It is designed to evaluate model performance on these two tasks, and the experimental results demonstrate that the dual-attention model significantly outperforms baseline models. The dataset includes 10 million frames under the simple setup and 50 million frames under the challenging setup, with its tasks aiming to jointly learn Semantic Goal Navigation and Embodied Question Answering.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作