Calculating the reward-rate manifold in the Gaussian-MSE case from Resource-rational reinforcement learning and sensorimotor causal states, and resource-rational maximiners
收藏DataCite Commons2025-10-27 更新2026-04-25 收录
下载链接:
https://rs.figshare.com/articles/dataset/Calculating_the_reward-rate_manifold_in_the_Gaussian-MSE_case_from_Resource-rational_reinforcement_learning_and_sensorimotor_causal_states_and_resource-rational_maximiners/30454062
下载链接
链接失效反馈官方服务:
资源简介:
Calculates a reward-rate manifold when statistics are Gaussian and the reward function is a squared error. In this case, the generalized Blahut-Arimoto algorithm massively simplifies, and we can use the equations in Appendix D to find Fig 1.
当统计分布服从高斯分布且奖励函数为均方误差(squared error)时,可求解得到奖励率流形(reward-rate manifold)。在此场景下,广义布拉胡特-有马算法(generalized Blahut-Arimoto algorithm)的形式会大幅简化,我们可借助附录D中的公式推导得到图1。
提供机构:
The Royal Society
创建时间:
2025-10-27



