Reinforcement Learning for Assessing Route Instruction Usability in Complex Indoor Spaces

Name: Reinforcement Learning for Assessing Route Instruction Usability in Complex Indoor Spaces
Creator: figshare
Published: 2025-05-15 06:39:27
License: 暂无描述

DataCite Commons2025-05-15 更新2025-09-08 收录

下载链接：

https://figshare.com/articles/dataset/Reinforcement_Learning_for_Assessing_Route_Instruction_Usability_in_Complex_Indoor_Spaces/29069243

下载链接

链接失效反馈

官方服务：

资源简介：

Wayfinding in complex indoor spaces is challenging, particularly when route instructions are incomplete or ambiguous. While humans often successfully navigate under such conditions by leveraging past experiences, computationally modeling this adaptive skill remains an open research area. This paper addresses the problem of assessing the usability of incomplete route instructions in unknown indoor environments. The objective is to determine if Reinforcement Learning (RL) agents, trained in diverse settings, can acquire transferable wayfinding skills to navigate effectively despite missing information. We demonstrate a novel computational approach using RL, specifically Proximal Policy Optimization (PPO), to model the acquisition and transfer of wayfinding skills. Agents are trained via a curriculum learning approach in simulated text-based indoor environments of varying complexity and with different turn-based instruction grammars. Their ability to navigate with both complete and incomplete instructions is then evaluated in seen and unseen environments. Our results show that RL agents successfully learn to navigate even with incomplete instructions, significantly outperforming random agents. Agents trained on diverse environments generalize well to novel settings, although performance decreases with higher environmental complexity and finer-grained instruction grammars when instructions are incomplete. Step-cost reward structures yielded better learning outcomes. This research offers a probabilistic framework for quantifying route instruction usability. The framework reframes route evaluation from deterministic error checking to probabilistic risk assessment. It lets designers gauge when `imperfect’ instructions remain usable, optimise wording and grammar, and develop adaptive indoor navigation, evacuation, or accessibility aids. By quantifying skill transfer across layouts, it also provides an experimental testbed for cognitive theories of heuristic learning and spatial knowledge acquisition

提供机构：

figshare

创建时间：

2025-05-15

5,000+

优质数据集

54 个

任务类型

进入经典数据集