Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Optimization
收藏DataCite Commons2022-12-23 更新2024-07-13 收录
下载链接:
https://dataverse.lib.nycu.edu.tw/citation?persistentId=doi:10.57770/QHNWZ7
下载链接
链接失效反馈官方服务:
资源简介:
This repo contains code accompanying the paper, Escaping from Zero Gradient: Revisiting Action-Constrained ReinforcementLearning via Frank-Wolfe Optimization (UAI 2021). It includes code for running the NFWPO algorithm presented in the paper, and other baseline methods such as DDPG+OptLayer, DDPG+Projection, DDPG+Reward Shaping, SAC+Projection, PPO+Projection, TRPO+Projection, FOCOPS.
提供机构:
NYCU Dataverse
创建时间:
2022-06-15



