URLB 强化学习数据集

超神经2023-01-30 更新2024-05-15 收录

下载链接：

https://hyper.ai/cn/datasets/20612

下载链接

链接失效反馈

官方服务：

资源简介：

URLB 全称 Unsupervised Reinforcement Learning Benchmark，是一个无监管强化学习数据集。 URLB 包括两个阶段：无奖励的预训练阶段和有外部奖励的下游任务适应阶段。在 DeepMind 控制套件的基础上，该数据集提供了来自三个领域的 12 个连续控制任务以供评估。

URLB, whose full name is Unsupervised Reinforcement Learning Benchmark, is an unsupervised reinforcement learning dataset. It comprises two phases: a reward-free pre-training phase and a downstream task adaptation phase with external rewards. Built on the DeepMind Control Suite, this dataset provides 12 continuous control tasks across three domains for evaluation.

创建时间：

2022-10-12

搜集汇总

数据集介绍

背景与挑战

背景概述

URLB（Unsupervised Reinforcement Learning Benchmark）是一个无监督强化学习基准数据集，包含无奖励预训练和有外部奖励下游任务适应两个阶段。该数据集基于DeepMind控制套件，提供了来自三个领域的12个连续控制任务，用于评估强化学习算法。

以上内容由遇见数据集搜集并总结生成