On the Theory of Reinforcement Learning

DataCite Commons2024-12-16 更新2025-04-16 收录

下载链接：

https://service.tib.eu/ldmservice/dataset/b4353950-a98d-4ed3-8825-1a7acd676867

下载链接

链接失效反馈

官方服务：

资源简介：

The dataset is used to study a theory of reinforcement learning (RL) in which the learner receives binary feedback only once at the end of an episode.

提供机构：

TIB

创建时间：

2024-12-16

5,000+

优质数据集

54 个

任务类型

进入经典数据集