English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  An inductive bias for slowly changing features in human reinforcement learning

Hedrich, N. L., Schulz, E., Hall-McMaster, S., & Schuck, N. W. (2024). An inductive bias for slowly changing features in human reinforcement learning. BioRxiv, January 24, 2024.

Item is

Files

show Files
hide Files
:
2024.01.24.576910v1.full.pdf (Preprint), 3MB
 
File Permalink:
-
Name:
2024.01.24.576910v1.full.pdf
Description:
-
OA-Status:
Visibility:
Restricted (Max Planck Institute for Human Development, MBBF; )
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-
:
media-1-5.pdf (Supplementary material), 4MB
Name:
media-1-5.pdf
Description:
-
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show
hide
Description:
-
OA-Status:
Green

Creators

show
hide
 Creators:
Hedrich, Noa L.1, Author           
Schulz, Eric, Author
Hall-McMaster, Sam1, 2, Author                 
Schuck, Nicolas W.1, 2, Author                 
Affiliations:
1Max Planck Research Group NeuroCode - Neural and Computational Basis of Learning, Memory and Decision Making, Max Planck Institute for Human Development, Max Planck Society, ou_2489696              
2Max Planck UCL Centre for Computational Psychiatry and Ageing Research, Berlin, Germany, and London, UK, Max Planck Institute for Human Development, Max Planck Society, Lentzeallee 94, D-14195 Berlin, DE; Russell Square House, 10-12 Russell Square, London, WC1B 5EH, UK, ou_2205641              

Content

show
hide
Free keywords: -
 Abstract: Identifying goal-relevant features in novel environments is a central challenge for efficient behaviour. We asked whether humans address this challenge by relying on prior knowledge about common properties of reward-predicting features. One such property is the rate of change of features, given that behaviourally relevant processes tend to change on a slower timescale than noise. Hence, we asked whether humans are biased to learn more when task-relevant features are slow rather than fast. To test this idea, 100 human participants were asked to learn the rewards of two-dimensional bandits when either a slowly or quickly changing feature of the bandit predicted reward. Participants accrued more reward and achieved better generalisation to unseen feature values when a bandit's relevant feature changed slowly, and its irrelevant feature quickly, as compared to the opposite. Participants were also more likely to incorrectly base their choices on the irrelevant feature when it changed slowly versus quickly. These effects were stronger when participants experienced the feature speed before learning about rewards. Modelling this behaviour with a set of four function approximation Kalman filter models that embodied alternative hypotheses about how feature speed could affect learning revealed that participants had a higher learning rate for the slow feature, and adjusted their learning to both the relevance and the speed of feature changes. The larger the improvement in participants' performance for slow compared to fast bandits, the more strongly they adjusted their learning rates. These results provide evidence that human reinforcement learning favours slower features, suggesting a bias in how humans approach reward learning.

Details

show
hide
Language(s): eng - English
 Dates: 2024-01-24
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: No review
 Identifiers: DOI: 10.1101/2024.01.24.576910
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: BioRxiv
Source Genre: Web Page
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: January 24, 2024 Start / End Page: - Identifier: -