Faster learning from slow features: The temporal coherence prior in human 
reinforcement learning

Hedrich, N; Hall-McMaster, S; Schulz, E; Schuck, N

doi:10.32470/CCN.2023.1471-0

Local TagsRelease HistoryDetailsSummary

Faster learning from slow features: The temporal coherence prior in human reinforcement learning

Hedrich, N., Hall-McMaster, S., Schulz, E., & Schuck, N. (2023). Faster learning from slow features: The temporal coherence prior in human reinforcement learning. In 2023 Conference on Cognitive Computational Neuroscience (pp. 458-461). doi:10.32470/CCN.2023.1471-0.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-000D-4DAC-C Version Permalink: https://hdl.handle.net/21.11116/0000-000D-AE48-F

Genre: Conference Paper

Files

show Files

Locators

show

hide

Locator:
https://2023.ccneuro.org/view_paper.php?PaperNum=1471 (Abstract) Open Access status unknown

Description:
-

OA-Status:
Not specified

Creators

show

hide

Creators:
Hedrich, N, Author
Hall-McMaster, S, Author
Schulz, E¹, Author
Schuck, N, Author

Affiliations:
1Research Group Computational Principles of Intelligence, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_3189356

Content

show

hide

Free keywords: -

Abstract: Acquiring goal-directed behaviors requires us to learn which features of our environment are task-relevant, and which can be ignored. Machine learning research has suggested that meaningful information in the input data is often represented by features that change slowly over time, while fast variations may represent noise. Focusing on slowly changing features of the environment during learning could therefore be a useful bias for humans when selecting task-relevant features, even when the underlying task structure is unknown. To test this idea, we investigated whether humans are better at learning the reward predictiveness of slow vs fast changing features of two-dimensional bandits. We found that subjects accrued more reward during learning and achieved higher accuracy on subsequent test trials when a bandit's relevant feature changed slowly and its irrelevant feature fast, as compared to the opposite. Model fitting with a set of function approximation models that either had a single fixed learning rate or feature speed dependent learning rates showed that participants with a stronger effect adapted their learning rates to the feature coherence. These results provide evidence that human reinforcement learning is sensitive the timescales over which features change, akin to the ‘temporal coherence prior’ in the machine learning literature.

Details

show

hide

Language(s):

Dates: Published Online: 2023-08

Publication Status: Published online

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: DOI: 10.32470/CCN.2023.1471-0

Degree: -

Event

show

hide

Title: Conference on Cognitive Computational Neuroscience (CCN 2023)

Place of Event: Oxford, UK

Start-/End Date: 2023-08-24 - 2023-08-27

Legal Case

show

Project information

show

Source 1

show

hide

Title: 2023 Conference on Cognitive Computational Neuroscience

Source Genre: Proceedings

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: - Sequence Number: P-2.10 Start / End Page: 458 - 461 Identifier: -