five

Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Optimization

收藏
DataCite Commons2022-12-23 更新2024-07-13 收录
下载链接:
https://dataverse.lib.nycu.edu.tw/citation?persistentId=doi:10.57770/QHNWZ7
下载链接
链接失效反馈
官方服务:
资源简介:
This repo contains code accompanying the paper, Escaping from Zero Gradient: Revisiting Action-Constrained ReinforcementLearning via Frank-Wolfe Optimization (UAI 2021). It includes code for running the NFWPO algorithm presented in the paper, and other baseline methods such as DDPG+OptLayer, DDPG+Projection, DDPG+Reward Shaping, SAC+Projection, PPO+Projection, TRPO+Projection, FOCOPS.
提供机构:
NYCU Dataverse
创建时间:
2022-06-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作