English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Introspection dynamics: a simple model of counterfactual learning in asymmetric games

MPS-Authors
/persons/resource/persons274056

Couto,  M. C.
IMPRS for Evolutionary Biology, Max Planck Institute for Evolutionary Biology, Max Planck Society;
Max Planck Research Group Dynamics of Social Behavior (Hilbe), Max Planck Institute for Evolutionary Biology, Max Planck Society;

/persons/resource/persons199253

Giaimo,  S.
Department Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Max Planck Society;

/persons/resource/persons61106

Hilbe,  C.
Max Planck Research Group Dynamics of Social Behavior (Hilbe), Max Planck Institute for Evolutionary Biology, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

Couto_2022_New_J._Phys._24_063010.pdf
(Publisher version), 4MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Couto, M. C., Giaimo, S., & Hilbe, C. (2022). Introspection dynamics: a simple model of counterfactual learning in asymmetric games. New Journal of Physics, 24: 063010. doi:10.1088/1367-2630/ac6f76.


Cite as: https://hdl.handle.net/21.11116/0000-000A-FBBD-7
Abstract
Social behavior in human and animal populations can be studied as an evolutionary process.Individuals often make decisions between different strategies, and those strategies that yield afitness advantage tend to spread. Traditionally, much work in evolutionary game theory considerssymmetric games: individuals are assumed to have access to the same set of strategies, and theyexperience the same payoff consequences. As a result, they can learn more profitable strategies byimitation. However, interactions are oftentimes asymmetric. In that case, imitation may beinfeasible (because individuals differ in the strategies they are able to use), or it may be undesirable(because individuals differ in their incentives to use a strategy). Here, we consider an alternativelearning process which applies to arbitrary asymmetric games,introspection dynamics. Accordingto this dynamics, individuals regularly compare their present strategy to a randomly chosenalternative strategy. If the alternative strategy yields a payoff advantage, it is more likely adopted. Inthis work, we formalize introspection dynamics for pairwise games. We derive simple and explicitformulas for the abundance of each strategy over time and apply these results to severalwell-known social dilemmas. In particular, for the volunteer’s timing dilemma, we show that theplayer with the lowest cooperation cost learns to cooperate without delay.