Combined model-free and model-sensitive reinforcement learning in non-human 
primates

Miranda, B; Malalasekera, WMN; Behrens, TE; Dayan, P; Kennerley, SW

doi:10.1371/journal.pcbi.1007944

Lokale TagsFreigabegeschichteDetailsÜbersicht

Combined model-free and model-sensitive reinforcement learning in non-human primates

Miranda, B., Malalasekera, W., Behrens, T., Dayan, P., & Kennerley, S. (2020). Combined model-free and model-sensitive reinforcement learning in non-human primates. PLoS Computational Biology, 16(6), 1-25. doi:10.1371/journal.pcbi.1007944.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-0006-B9F5-5 Versions-Permalink: https://hdl.handle.net/21.11116/0000-0006-B9F6-4

Genre: Zeitschriftenartikel

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1007944&type=printable (Verlagsversion) Open Access Status unbekannt

Beschreibung:
-

OA-Status:

Urheber

einblenden:

ausblenden:

Urheber:
Miranda, B, Autor
Malalasekera, WMN, Autor
Behrens, TE, Autor
Dayan, P¹, Autor
Kennerley, SW, Autor

Affiliations:
1External Organizations, ou_persistent22

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: Contemporary reinforcement learning (RL) theory suggests that potential choices can be evaluated by strategies that may or may not be sensitive to the computational structure of tasks. A paradigmatic model-free (MF) strategy simply repeats actions that have been rewarded in the past; by contrast, model-sensitive (MS) strategies exploit richer information associated with knowledge of task dynamics. MF and MS strategies should typically be combined, because they have complementary statistical and computational strengths; however, this tradeoff between MF/MS RL has mostly only been demonstrated in humans, often with only modest numbers of trials. We trained rhesus monkeys to perform a two-stage decision task designed to elicit and discriminate the use of MF and MS methods. A descriptive analysis of choice behaviour revealed directly that the structure of the task (of MS importance) and the reward history (of MF and MS importance) significantly influenced both choice and response vigour. A detailed, trial-by-trial computational analysis confirmed that choices were made according to a combination of strategies, with a dominant influence of a particular form of model sensitivity that persisted over weeks of testing. The residuals from this model necessitated development of a new combined RL model which incorporates a particular credit assignment weighting procedure. Finally, response vigor exhibited a subtly different collection of MF and MS influences. These results provide new illumination onto RL behavioural processes in non-human primates.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Online veröffentlicht: 2020-06

Publikationsstatus: Online veröffentlicht

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: DOI: 10.1371/journal.pcbi.1007944
eDoc: e1007944

Art des Abschluß: -

ausblenden:

Titel: PLoS Computational Biology

Genre der Quelle: Zeitschrift

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: San Francisco, CA : Public Library of Science

Seiten: - Band / Heft: 16 (6) Artikelnummer: - Start- / Endseite: 1 - 25 Identifikator: ISSN: 1553-734X
CoNE: https://pure.mpg.de/cone/journals/resource/1000000000017180_1

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1