The pursuit of happiness: A reinforcement learning perspective on habituation 
and comparisons

Dubey, R; Griffiths, T; Dayan, P

Lokale TagsFreigabegeschichteDetailsÜbersicht

The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons

Dubey, R., Griffiths, T., & Dayan, P. (2022). The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons. Talk presented at Annual Meeting of the Society for NeuroEconomics (SNE 2022). Arlington, VA, USA. 2022-09-30 - 2022-10-02.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-000B-0CF9-0 Versions-Permalink: https://hdl.handle.net/21.11116/0000-000B-0D01-6

Genre: Vortrag

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
https://neuroeconomics.org/wp-content/uploads/2022/09/SNE2022_Abstract-Proceedings.pdf (Zusammenfassung) Open Access Status unbekannt

Beschreibung:
-

OA-Status:
Keine Angabe

Urheber

einblenden:

ausblenden:

Urheber:
Dubey, R, Autor
Griffiths, T, Autor
Dayan, P¹, Autor

Affiliations:
1Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_3017468

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: Objective: In evaluating our choices, we often suffer from two tragic relativities. First, when our lives change for the better, we rapidly habituate to the higher standard of living. Second, we cannot escape comparing ourselves to various relative standards. Habituation and comparisons can be very disruptive to our happiness and decision-making, and to date, it remains a puzzle why they have come to be a part of cognition in the first place. This study's objective is to provide a precise characterization of how and why these relative aspects might be desirable features of intelligent agents. Methods: Here, we adopt the computational framework of reinforcement learning (RL). In standard RL theory, the reward function serves the role of defining optimal behavior i.e., what the agent ought to accomplish. However, recent work on reward design has embraced the observation that the reward function plays a second, critical, role in RL in steering the agent from incompetence to mastery. These steering reward functions, often provided by the designer to the agent, have subjective features detached from the particular task but can nevertheless guide the learning of the agent. Here, we use this idea and endow agents with a subjective reward function that, in addition to the reward provided by the underlying task, also depends on prior expectations and relative comparisons. We then embed these agents in various parameterized environments and compare their performance against standard RL agents whose reward function depends on just the task reward value. Results: Extensive simulations reveal that agents equipped with this reward function learn and explore very efficiently in a wide range of settings. Notably, they significantly outperform standard reward-based agents in sparsely-rewarded, t(198) = 35.6, p < 0.01, and non-stationary environments t(198) = 30.1, p < 0.01. Our simulations also reveal potential drawbacks of this reward function and show that agents perform sub-optimally when comparisons are left unchecked and when there are too many similar options. Conclusions: Our results suggest that a subjective reward function based on prior expectations and comparisons might play an important role in promoting adaptive behavior by serving as a powerful learning signal. This provides computational support for a longstanding assumption in the field and explains why the human reward function might be based on these features. Together, our results help explain why we are prone to becoming trapped in a cycle of never-ending wants and desires, and may shed light on psychopathologies such as depression, materialism, and overconsumption.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Online veröffentlicht: 2022-10

Publikationsstatus: Online veröffentlicht

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: -

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: Annual Meeting of the Society for NeuroEconomics (SNE 2022)

Veranstaltungsort: Arlington, VA, USA

Start-/Enddatum: 2022-09-30 - 2022-10-02

ausblenden:

Titel: Annual Meeting of the Society for NeuroEconomics (SNE 2022)

Genre der Quelle: Konferenzband

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: -

Seiten: - Band / Heft: - Artikelnummer: S.01.04 Start- / Endseite: 5 - 6 Identifikator: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1