Vision-and-Language Parking Dataset

Name: Vision-and-Language Parking Dataset
Creator: PENGYU FU
Published: 2025-05-20 00:00:00
License: 暂无描述

科学数据银行2025-05-20 更新2026-04-23 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=c0a8693dbdeb4adfb2829e12f6b1a4de

下载链接

链接失效反馈

官方服务：

资源简介：

Autonomous parking represents the final stage in autonomous driving applications. However, current Autonomous Valet Parking (AVP) technology still relies on human-in-the-loop decision-making for parking spots due to the limitations in scene understanding and challenges in multimodal data fusion. This paper defines autonomous parking decision-making as a Vision-and-Language Navigation (VLN) problem, where an autonomous vehicle makes decisions based on visual perception and user command interpretation in parking lots. By formalizing the static object features of typical parking lots, including locations, obstacles and attributes, this paper introduces the Vision-and-Language Parking (VLP) dataset, featuring 174 onboard panoramic images and 11310 structured information from real parking environments at six different time points and two structures, with more than 300 instructions, marking it as the first vision navigation dataset driven by user natural language instructions.

提供机构：

PENGYU FU

创建时间：

2024-09-05

5,000+

优质数据集

54 个

任务类型

进入经典数据集