Risking your Tail: Modeling Individual Differences in Risk-sensitive 
Exploration using Conditional Value at Risk and Bayes Adaptive Markov Decision 
Processes

Shen, T; Dayan, P

doi:10.1101/2024.01.07.574574

Lokale TagsFreigabegeschichteDetailsÜbersicht

Risking your Tail: Modeling Individual Differences in Risk-sensitive Exploration using Conditional Value at Risk and Bayes Adaptive Markov Decision Processes

Shen, T., & Dayan, P. (submitted). Risking your Tail: Modeling Individual Differences in Risk-sensitive Exploration using Conditional Value at Risk and Bayes Adaptive Markov Decision Processes.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-000E-599E-D Versions-Permalink: https://hdl.handle.net/21.11116/0000-000F-63DB-B

Genre: Preprint

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
https://www.biorxiv.org/content/10.1101/2024.01.07.574574v2.full.pdf (beliebiger Volltext) Open Access Status unbekannt

Beschreibung:
-

OA-Status:
Keine Angabe

Urheber

einblenden:

ausblenden:

Urheber:
Shen, T¹, Autor
Dayan, P¹, Autor

Affiliations:
1Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_3017468

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: Novelty is a double-edged sword for agents and animals alike: they might benefit from untapped resources or face unexpected costs or dangers such as predation. The conventional exploration/exploitation tradeoff is thus coloured by risk-sensitivity. A wealth of experiments has shown how animals solve this dilemma, for example using intermittent approach. However, there are large individual differences in the nature of approach, and modeling has yet to elucidate how this might be based on animals' differing prior expectations about reward and threat and degrees of risk aversion. To capture these factors, we built a Bayes adaptive Markov decision process model with three key components: an adaptive hazard function capturing potential predation, an intrinsic reward function providing the urge to explore, and a conditional value at risk (CVaR) objective, which is a contemporary measure of trait risk-sensitivity. We fit this model to a coarse-grain abstraction of the behaviour of 26 animals who freely explored a novel object in an open-field arena (Akiti et al. Neuron 110, 2022). We show that the model captures both quantitative (frequency, duration of exploratory bouts) and qualitative (stereotyped tail-behind) features of behavior, including the substantial idiosyncrasies that were observed. We find that “brave” animals, though varied in their behavior, generally are more risk neutral, and enjoy a flexible hazard prior. They begin with cautious exploration, and quickly transition to confident approach to maximize exploration for reward. On the other hand, “timid” animals, characterized by risk aversion and high and inflexible hazard priors, display self-censoring that leads to the sort of asymptotic maladaptive behavior that is often associated with psychiatric illnesses such as anxiety and depression. Explaining risk-sensitive exploration using factorized parameters of reinforcement learning models could aid in the understanding, diagnosis, and treatment of psychiatric abnormalities in humans and other animals.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Eingereicht: 2024-06

Publikationsstatus: Eingereicht

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: DOI: 10.1101/2024.01.07.574574

Art des Abschluß: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle