five

A Priority Map for Vision-Language Navigation - Datasets

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/6891964
下载链接
链接失效反馈
官方服务:
资源简介:
This archive contains full versions of the datasets and additional data presented in the following paper: A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues A priority map module (PM-VLN) boosts the performance of transformer-based architectures in navigation tasks by combining temporal sequence alignment and feature-level localisation in cross-modal inputs. The module is pretrained on trajectory estimation and a multi-objective task that pairs location estimation with cross-modal sentence prediction. Two datasets are introduced for the auxiliary tasks:  - TR-NY-PIT-central - a set of path traces for routes in two urban locations.  - MC-10 - a set of samples with multimodal inputs representing landmarks in 10 US cities. Full details and links for this research are available at the following link: https://jasonarmitage-res.github.io/projects/priority_map/ Additional data comprising path traces for routes in Manhattan and language tokens for the Touchdown task are provided for training and evaluating the PM-VLN and framework on the Touchdown benchmark. Please refer to the following link for details on the Touchdown dataset and StreetLearn environment: https://sites.google.com/view/streetlearn/touchdown
创建时间:
2024-07-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作