Vision-and-Language Parking Dataset
收藏科学数据银行2025-05-20 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=c0a8693dbdeb4adfb2829e12f6b1a4de
下载链接
链接失效反馈官方服务:
资源简介:
Autonomous parking represents the final stage in autonomous driving applications. However, current Autonomous Valet Parking (AVP) technology still relies on human-in-the-loop decision-making for parking spots due to the limitations in scene understanding and challenges in multimodal data fusion. This paper defines autonomous parking decision-making as a Vision-and-Language Navigation (VLN) problem, where an autonomous vehicle makes decisions based on visual perception and user command interpretation in parking lots. By formalizing the static object features of typical parking lots, including locations, obstacles and attributes, this paper introduces the Vision-and-Language Parking (VLP) dataset, featuring 174 onboard panoramic images and 11310 structured information from real parking environments at six different time points and two structures, with more than 300 instructions, marking it as the first vision navigation dataset driven by user natural language instructions.
提供机构:
PENGYU FU
创建时间:
2024-09-05



