Supplementary Material from Searching for a source without gradients: how good is infotaxis and how to beat it

Name: Supplementary Material from Searching for a source without gradients: how good is infotaxis and how to beat it
Creator: The Royal Society
Published: 2022-06-13 14:16:45
License: 暂无描述

DataCite Commons2022-06-13 更新2024-07-29 收录

下载链接：

https://rs.figshare.com/articles/dataset/Supplementary_Material_from_Searching_for_a_source_without_gradients_how_good_is_infotaxis_and_how_to_beat_it/20060550

下载链接

链接失效反馈

官方服务：

资源简介：

Infotaxis is a popular search algorithm designed to track a source of odour in a turbulent environment using information provided by odour detections. To exemplify its capabilities, the source-tracking task was framed as a partially observable Markov decision process consisting in finding, as fast as possible, a stationary target hidden in a two-dimensional grid using stochastic partial observations of the target location. Here, we provide an extended review of infotaxis, together with a toolkit for devising better strategies. We first characterize the performance of infotaxis in domains from one dimension to four dimensions. Our results show that, while being suboptimal, infotaxis is reliable (the probability of not reaching the source approaches zero), efficient (the mean search time scales as expected for the optimal strategy) and safe (the tail of the distribution of search times decays faster than any power law, though subexponentially). We then present three possible ways of beating infotaxis, all inspired by methods used in artificial intelligence: tree search, heuristic approximation of the value function, and deep reinforcement learning. The latter is able to find, without any prior human knowledge, the (near) optimal strategy. Altogether, our results provide evidence that the margin of improvement of infotaxis towards the optimal strategy gets smaller as the dimensionality increases.

提供机构：

The Royal Society

创建时间：

2022-06-13