Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Optimization
收藏DataCite Commons2022-05-31 更新2025-04-16 收录
下载链接:
https://dataverse.lib.nycu.edu.tw/citation?persistentId=doi:10.57770/2VTTKE
下载链接
链接失效反馈官方服务:
资源简介:
This repo contains code accompanying the paper, Escaping from Zero Gradient: Revisiting Action-Constrained ReinforcementLearning via Frank-Wolfe Optimization (UAI 2021). It includes code for running the NFWPO algorithm presented in the paper, and other baseline methods such as DDPG+OptLayer, DDPG+Projection, DDPG+Reward Shaping, SAC+Projection, PPO+Projection, TRPO+Projection, FOCOPS.
提供机构:
NYCU Dataverse
创建时间:
2022-05-31



