The pursuit of happiness: A reinforcement learning perspective on habituation 
and comparisons

Dubey, R; Griffiths, TL; Dayan, P

doi:10.31234/osf.io/8jd2x

Local TagsRelease HistoryDetailsSummary

The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons

Dubey, R., Griffiths, T., & Dayan, P. (2022). The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons. PLoS Computational Biology, 18(8): e1010316. doi:10.31234/osf.io/8jd2x.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-000A-6E1D-C Version Permalink: https://hdl.handle.net/21.11116/0000-000A-D54F-E

Genre: Journal Article

Files

show Files

Locators

show

hide

Locator:
https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1010316&type=printable (Publisher version) Open Access status unknown

Description:
-

OA-Status:

Creators

show

hide

Creators:
Dubey, R, Author
Griffiths, TL, Author
Dayan, P¹, Author

Affiliations:
1Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_3017468

Content

show

hide

Free keywords: -

Abstract: In evaluating our choices, we often suffer from two tragic relativities. First, when our lives change for the better, we rapidly habituate to the higher standard of living. Second, we cannot escape comparing ourselves to various relative standards. Habituation and comparisons can be very disruptive to decision-making and happiness, and till date, it remains a puzzle why they have come to be a part of cognition in the first place. Here, we present computational evidence that suggests that these features might play an important role in promoting adaptive behavior. Using the framework of reinforcement learning, we explore the benefit of employing a reward function that, in addition to the reward provided by the underlying task, also depends on prior expectations and relative comparisons. We find that while agents equipped with this reward function are less happy, they learn faster and significantly outperform standard reward-based agents in a wide range of environments. Specifically, we find that relative comparisons speed up learning by providing an exploration incentive to the agents, and prior expectations serve as a useful aid to comparisons, especially in sparsely-rewarded and non-stationary environments. Our simulations also reveal potential drawbacks of this reward function and show that agents perform sub-optimally when comparisons are left unchecked and when there are too many similar options. Together, our results help explain why we are prone to becoming trapped in a cycle of never-ending wants and desires, and may shed light on psychopathologies such as depression, materialism, and overconsumption.

Details

show

hide

Language(s):

Dates: Published Online: 2022-08

Publication Status: Published online

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: DOI: 10.31234/osf.io/8jd2x

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: PLoS Computational Biology

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: San Francisco, CA : Public Library of Science

Pages: 32 Volume / Issue: 18 (8) Sequence Number: e1010316 Start / End Page: - Identifier: ISSN: 1553-734X
CoNE: https://pure.mpg.de/cone/journals/resource/1000000000017180_1