Calculating the reward-rate manifold in the Gaussian-MSE case from Resource-rational reinforcement learning and sensorimotor causal states, and resource-rational maximiners

Name: Calculating the reward-rate manifold in the Gaussian-MSE case from Resource-rational reinforcement learning and sensorimotor causal states, and resource-rational maximiners
Creator: The Royal Society
Published: 2025-10-27 10:05:07
License: 暂无描述

DataCite Commons2025-10-27 更新2026-04-25 收录

下载链接：

https://rs.figshare.com/articles/dataset/Calculating_the_reward-rate_manifold_in_the_Gaussian-MSE_case_from_Resource-rational_reinforcement_learning_and_sensorimotor_causal_states_and_resource-rational_maximiners/30454062

下载链接

链接失效反馈

官方服务：

资源简介：

Calculates a reward-rate manifold when statistics are Gaussian and the reward function is a squared error. In this case, the generalized Blahut-Arimoto algorithm massively simplifies, and we can use the equations in Appendix D to find Fig 1.

当统计分布服从高斯分布且奖励函数为均方误差（squared error）时，可求解得到奖励率流形（reward-rate manifold）。在此场景下，广义布拉胡特-有马算法（generalized Blahut-Arimoto algorithm）的形式会大幅简化，我们可借助附录D中的公式推导得到图1。

提供机构：

The Royal Society

创建时间：

2025-10-27