Cargando…

Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games

In recent years collaborative robots have become major market drivers in industry 5.0, which aims to incorporate them alongside humans in a wide array of settings ranging from welding to rehabilitation. Improving human–machine collaboration entails using computational algorithms that will save proce...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mizrahi, Dor, Zuckerman, Inon, Laufer, Ilan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763831/ https://www.ncbi.nlm.nih.gov/pubmed/33302476 http://dx.doi.org/10.3390/s20247026

_version_	1783628111723626496
author	Mizrahi, Dor Zuckerman, Inon Laufer, Ilan
author_facet	Mizrahi, Dor Zuckerman, Inon Laufer, Ilan
author_sort	Mizrahi, Dor
collection	PubMed
description	In recent years collaborative robots have become major market drivers in industry 5.0, which aims to incorporate them alongside humans in a wide array of settings ranging from welding to rehabilitation. Improving human–machine collaboration entails using computational algorithms that will save processing as well as communication cost. In this study we have constructed an agent that can choose when to cooperate using an optimal strategy. The agent was designed to operate in the context of divergent interest tacit coordination games in which communication between the players is not possible and the payoff is not symmetric. The agent’s model was based on a behavioral model that can predict the probability of a player converging on prominent solutions with salient features (e.g., focal points) based on the player’s Social Value Orientation (SVO) and the specific game features. The SVO theory pertains to the preferences of decision makers when allocating joint resources between themselves and another player in the context of behavioral game theory. The agent selected stochastically between one of two possible policies, a greedy or a cooperative policy, based on the probability of a player to converge on a focal point. The distribution of the number of points obtained by the autonomous agent incorporating the SVO in the model was better than the results obtained by the human players who played against each other (i.e., the distribution associated with the agent had a higher mean value). Moreover, the distribution of points gained by the agent was better than any of the separate strategies the agent could choose from, namely, always choosing a greedy or a focal point solution. To the best of our knowledge, this is the first attempt to construct an intelligent agent that maximizes its utility by incorporating the belief system of the player in the context of tacit bargaining. This reward-maximizing strategy selection process based on the SVO can also be potentially applied in other human–machine contexts, including multiagent systems.
format	Online Article Text
id	pubmed-7763831
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-77638312020-12-27 Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games Mizrahi, Dor Zuckerman, Inon Laufer, Ilan Sensors (Basel) Article In recent years collaborative robots have become major market drivers in industry 5.0, which aims to incorporate them alongside humans in a wide array of settings ranging from welding to rehabilitation. Improving human–machine collaboration entails using computational algorithms that will save processing as well as communication cost. In this study we have constructed an agent that can choose when to cooperate using an optimal strategy. The agent was designed to operate in the context of divergent interest tacit coordination games in which communication between the players is not possible and the payoff is not symmetric. The agent’s model was based on a behavioral model that can predict the probability of a player converging on prominent solutions with salient features (e.g., focal points) based on the player’s Social Value Orientation (SVO) and the specific game features. The SVO theory pertains to the preferences of decision makers when allocating joint resources between themselves and another player in the context of behavioral game theory. The agent selected stochastically between one of two possible policies, a greedy or a cooperative policy, based on the probability of a player to converge on a focal point. The distribution of the number of points obtained by the autonomous agent incorporating the SVO in the model was better than the results obtained by the human players who played against each other (i.e., the distribution associated with the agent had a higher mean value). Moreover, the distribution of points gained by the agent was better than any of the separate strategies the agent could choose from, namely, always choosing a greedy or a focal point solution. To the best of our knowledge, this is the first attempt to construct an intelligent agent that maximizes its utility by incorporating the belief system of the player in the context of tacit bargaining. This reward-maximizing strategy selection process based on the SVO can also be potentially applied in other human–machine contexts, including multiagent systems. MDPI 2020-12-08 /pmc/articles/PMC7763831/ /pubmed/33302476 http://dx.doi.org/10.3390/s20247026 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Mizrahi, Dor Zuckerman, Inon Laufer, Ilan Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games
title	Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games
title_full	Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games
title_fullStr	Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games
title_full_unstemmed	Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games
title_short	Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games
title_sort	using a stochastic agent model to optimize performance in divergent interest tacit coordination games
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763831/ https://www.ncbi.nlm.nih.gov/pubmed/33302476 http://dx.doi.org/10.3390/s20247026
work_keys_str_mv	AT mizrahidor usingastochasticagentmodeltooptimizeperformanceindivergentinteresttacitcoordinationgames AT zuckermaninon usingastochasticagentmodeltooptimizeperformanceindivergentinteresttacitcoordinationgames AT lauferilan usingastochasticagentmodeltooptimizeperformanceindivergentinteresttacitcoordinationgames

Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games

Ejemplares similares