Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model

Figshare2016-01-15 更新2026-04-29 收录

下载链接：

https://figshare.com/articles/dataset/_Implicit_Value_Updating_Explains_Transitive_Inference_Performance_The_Betasort_Model_/1557711

下载链接

链接失效反馈

官方服务：

资源简介：

Transitive inference (the ability to infer that B > D given that B > C and C > D) is a widespread characteristic of serial learning, observed in dozens of species. Despite these robust behavioral effects, reinforcement learning models reliant on reward prediction error or associative strength routinely fail to perform these inferences. We propose an algorithm called betasort, inspired by cognitive processes, which performs transitive inference at low computational cost. This is accomplished by (1) representing stimulus positions along a unit span using beta distributions, (2) treating positive and negative feedback asymmetrically, and (3) updating the position of every stimulus during every trial, whether that stimulus was visible or not. Performance was compared for rhesus macaques, humans, and the betasort algorithm, as well as Q-learning, an established reward-prediction error (RPE) model. Of these, only Q-learning failed to respond above chance during critical test trials. Betasort’s success (when compared to RPE models) and its computational efficiency (when compared to full Markov decision process implementations) suggests that the study of reinforcement learning in organisms will be best served by a feature-driven approach to comparing formal models.

创建时间：

2016-01-15

5,000+

优质数据集

54 个

任务类型

进入经典数据集