Dopamine Bonuses

Sham Kakade; Peter Dayan

Back to NeurIPS

NeurIPS 2000

Dopamine Bonuses

Conference Paper Artificial Intelligence · Machine Learning

PDF Details

Abstract

Substantial data support a temporal difference (TO) model of dopamine (OA) neuron activity in which the cells provide a global error signal for reinforcement learning. However, in certain cir(cid: 173) cumstances, OA activity seems anomalous under the TO model, responding to non-rewarding stimuli. We address these anoma(cid: 173) lies by suggesting that OA cells multiplex information about re(cid: 173) ward bonuses, including Sutton's exploration bonuses and Ng et al's non-distorting shaping bonuses. We interpret this additional role for OA in terms of the unconditional attentional and psy(cid: 173) chomotor effects of dopamine, having the computational role of guiding exploration.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: Annual Conference on Neural Information Processing Systems
Archive span: 1987-2025
Indexed papers: 30776
Paper id: 381478206221631912