English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Zero-shot compositional reinforcement learning in humans

Jagadish, A., Binz, M., Saanum, T., Wang, J., & Schulz, E. (submitted). Zero-shot compositional reinforcement learning in humans.

Item is

Files

show Files

Locators

show
hide
Locator:
https://psyarxiv.com/ymve5/download (Any fulltext)
Description:
-
OA-Status:
Not specified

Creators

show
hide
 Creators:
Jagadish, AK1, Author                 
Binz, M1, Author                 
Saanum, T1, Author           
Wang, JX, Author
Schulz, E1, Author           
Affiliations:
1Research Group Computational Principles of Intelligence, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_3189356              

Content

show
hide
Free keywords: -
 Abstract: People can easily evoke previously learned concepts, compose them, and apply the result to solve novel tasks on the first attempt. The aim of this paper is to improve our understanding of how people make such zero-shot compositional inferences in a reinforcement learning setting. To achieve this, we introduce an experimental paradigm where people learn two latent reward functions and need to compose them correctly to solve a novel task. We find that people have the capability to engage in zero-shot compositional reinforcement learning but deviate systematically from optimality. However, their mistakes are structured and can be explained by their performance in the sub-tasks leading up to the composition. Through extensive model-based analyses, we found that a meta-learned neural network model that accounts for limited computational resources best captures participants’ behaviour. Moreover, the amount of computational resources this model identified reliably quantifies how good individual participants are at zero-shot compositional reinforcement learning. Taken together, our work takes a considerable step towards studying compositional reasoning in agents – both natural and artificial – with limited computational resources.

Details

show
hide
Language(s):
 Dates: 2023-07
 Publication Status: Submitted
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.31234/osf.io/ymve5
 Degree: -

Event

show

Legal Case

show

Project information

show

Source

show