Cargando…

Win-Stay-Lose-Shift as a self-confirming equilibrium in the iterated Prisoner’s Dilemma

Evolutionary game theory assumes that players replicate a highly scored player’s strategy through genetic inheritance. However, when learning occurs culturally, it is often difficult to recognize someone’s strategy just by observing the behaviour. In this work, we consider players with memory-one st...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Minjae, Choi, Jung-Kyoo, Baek, Seung Ki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8242928/
https://www.ncbi.nlm.nih.gov/pubmed/34187189
http://dx.doi.org/10.1098/rspb.2021.1021
Descripción
Sumario:Evolutionary game theory assumes that players replicate a highly scored player’s strategy through genetic inheritance. However, when learning occurs culturally, it is often difficult to recognize someone’s strategy just by observing the behaviour. In this work, we consider players with memory-one stochastic strategies in the iterated Prisoner’s Dilemma, with an assumption that they cannot directly access each other’s strategy but only observe the actual moves for a certain number of rounds. Based on the observation, the observer has to infer the resident strategy in a Bayesian way and chooses his or her own strategy accordingly. By examining the best-response relations, we argue that players can escape from full defection into a cooperative equilibrium supported by Win-Stay-Lose-Shift in a self-confirming manner, provided that the cost of cooperation is low and the observational learning supplies sufficiently large uncertainty.