Human subjects exploit a cognitive map for credit assignment

Moran, R; Dayan, P; Dolan, RJ

doi:10.1073/pnas.2016884118

Local TagsRelease HistoryDetailsSummary

Human subjects exploit a cognitive map for credit assignment

Moran, R., Dayan, P., & Dolan, R. (2021). Human subjects exploit a cognitive map for credit assignment. Proceedings of the National Academy of Sciences of the United States of America, 118(4), 1-12. doi:10.1073/pnas.2016884118.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0007-D310-8 Version Permalink: https://hdl.handle.net/21.11116/0000-0007-D311-7

Genre: Journal Article

Files

show Files

Locators

show

hide

Locator:
https://www.pnas.org/content/pnas/118/4/e2016884118.full.pdf (Publisher version) Open Access status unknown

Description:
-

OA-Status:

Creators

show

hide

Creators:
Moran, R, Author
Dayan, P^{1, 2}, Author
Dolan, RJ, Author

Affiliations:
1Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_3017468
2Max Planck Institute for Biological Cybernetics, Max Planck Society, Spemannstrasse 38, 72076 Tübingen, DE, ou_1497794

Content

show

hide

Free keywords: -

Abstract: An influential reinforcement learning framework proposes that behavior is jointly governed by model-free (MF) and model-based (MB) controllers. The former learns the values of actions directly from past encounters, and the latter exploits a cognitive map of the task to calculate these prospectively. Considerable attention has been paid to how these systems interact during choice, but how and whether knowledge of a cognitive map contributes to the way MF and MB controllers assign credit (i.e., to how they revaluate actions and states following the receipt of an outcome) remains underexplored. Here, we examine such sophisticated credit assignment using a dual-outcome bandit task. We provide evidence that knowledge of a cognitive map influences credit assignment in both MF and MB systems, mediating subtly different aspects of apparent relevance. Specifically, we show MF credit assignment is enhanced for those rewards that are related to a choice, and this contrasted with choice-unrelated rewards that reinforced subsequent choices negatively. This modulation is only possible based on knowledge of task structure. On the other hand, MB credit assignment was boosted for outcomes that impacted on differences in values between offered bandits. We consider mechanistic accounts and the normative status of these findings. We suggest the findings extend the scope and sophistication of cognitive map-based credit assignment during reinforcement learning, with implications for understanding behavioral control.

Details

show

hide

Language(s):

Dates: Date issued: 2021-01

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: DOI: 10.1073/pnas.2016884118
eDoc: e2016884118

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Proceedings of the National Academy of Sciences of the United States of America

Other : PNAS

Other : Proceedings of the National Academy of Sciences of the USA

Abbreviation : Proc. Natl. Acad. Sci. U. S. A.

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: Washington, D.C. : National Academy of Sciences

Pages: - Volume / Issue: 118 (4) Sequence Number: - Start / End Page: 1 - 12 Identifier: ISSN: 0027-8424
CoNE: https://pure.mpg.de/cone/journals/resource/954925427230