Contingency and Correlation in Reversal Learning

Pietras, B; Dayan, P; Stalnaker, T; Schoenbaum, G; Yu, T-L

Lokale TagsFreigabegeschichteDetailsÜbersicht

Contingency and Correlation in Reversal Learning

Pietras, B., Dayan, P., Stalnaker, T., Schoenbaum, G., & Yu, T.-L. (2015). Contingency and Correlation in Reversal Learning. Poster presented at 2nd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2015), Edmonton, AB, Canada.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-0008-8444-6 Versions-Permalink: https://hdl.handle.net/21.11116/0000-0008-8445-5

Genre: Poster

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
https://rldm.org/wp-content/uploads/2015/06/RLDM15AbstractsBooklet.pdf (Zusammenfassung) Open Access Status unbekannt

Beschreibung:
-

OA-Status:

Urheber

einblenden:

ausblenden:

Urheber:
Pietras, B, Autor
Dayan, P¹, Autor
Stalnaker, T, Autor
Schoenbaum, G, Autor
Yu, T-L, Autor

Affiliations:
1External Organizations, ou_persistent22

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: Reversal learning is one of the most venerable paradigms for studying the acquisition, extinction,and reacquisition of knowledge in humans and other animals. It has been of particular value in askingquestions about the roles played by prefrontal structures such as the orbitofrontal cortex (OFC). Indeed,evidence from rats and monkeys suggests that these areas are involved in various forms of context-sensitiveinference about the contingencies linking cues and actions over time to the value and identity of predictedoutcomes. In order to explore these roles in depth, we fit data from a substantial behavioural neurosciencestudy in rodents who experienced blocks of free- and forced-choice instrumental learning trials with identityor value reversals at each block transition. We constructed two classes of models, fit their parametersusing a random effects treatment, tested their generative competence, and selected between them based on acomplexity-sensitive integrated Bayesian Information Criteria score. One class of ‘return’-based models wasbased on elaborations of a standard Q-learning algorithm, including parameters such as different learningrates or combination rules for forced- and fixed-choice trials, behavioural lapses, and eligibility traces. Theother novel class of ‘income’-based models exploited the weak notion of contingency over time advocatedby Walton et al (2010) in their analysis of the choices of monkeys with OFC lesions. We show that income-based and return-based models are both able to predict the behaviour well, and examine their performanceand implications for reinforcement learning. The outcome of this study sets the stage for the next phase ofthe research that will attempt to correlate the values of the parameters to neural recordings taken in the ratswhile performing the task.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Online veröffentlicht: 2015-06

Publikationsstatus: Online veröffentlicht

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: -

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: 2nd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2015)

Veranstaltungsort: Edmonton, AB, Canada

Start-/Enddatum: 2015-06-07 - 2015-06-10

ausblenden:

Titel: Reinforcement Learning and Decision Making 2015

Genre der Quelle: Konferenzband

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: -

Seiten: - Band / Heft: - Artikelnummer: M51 Start- / Endseite: 33 Identifikator: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1