FLIPPO_LO*TZ
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/2xnhcyn7g6
下载链接
链接失效反馈官方服务:
资源简介:
Resultant data reported in Feudal Independent Leader Proximal Policy Optimization by Austin Starken and Sean Mondesire. The paper answers the research question, "To what extent does a feudal hierarchy enhance independent PPO agents’ performance, scalability, and generalizability in a high-dimensional environment with sparse and delayed rewards compared to non-hierarchical methods?" The data files contain performance data for the approaches studied in the paper. Performance data was collected at 5 million training steps, 9 million training steps, and a final collection when training was complete. The total required training steps for each approach was collected in COMBINED_TS.xlsx
创建时间:
2024-12-18



