Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras

Name: Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Creator: University of Edinburgh. School of Informatics
Published: 2024-07-30 16:05:07
License: 暂无描述

DataCite Commons2024-07-30 更新2025-04-17 收录

下载链接：

https://datashare.ed.ac.uk/handle/10283/8832

下载链接

链接失效反馈

官方服务：

资源简介：

The performance of image-based Reinforcement Learning (RL) agents can vary depending on the position of the camera used to capture the images. Training on multiple cameras simultaneously, including a first-person egocentric camera, can leverage information from different camera perspectives to improve the performance of RL. However, hardware constraints may limit the availability of multiple cameras in real-world deployment. Additionally, cameras may become damaged in the real-world preventing access to all cameras that were used during training. To overcome these hardware constraints, we propose Multi-View Disentanglement (MVD), which uses multiple cameras to learn a policy that is robust to a reduction in the number of cameras to generalise to any single camera from the training set. Our approach is a self-supervised auxiliary task for RL that learns a disentangled representation from multiple cameras, with a shared representation that is aligned across all cameras to allow generalisation to a single camera, and a private representation that is camera-specific. We show experimentally that an RL agent trained on a single third-person camera is unable to learn an optimal policy in many control tasks; but, our approach, benefiting from multiple cameras during training, is able to solve the task using only the same single third-person camera.

提供机构：

University of Edinburgh. School of Informatics

创建时间：

2024-07-29

5,000+

优质数据集

54 个

任务类型

进入经典数据集