Cargando…

Signal detection models as contextual bandits

Signal detection theory (SDT) has been widely applied to identify the optimal discriminative decisions of receivers under uncertainty. However, the approach assumes that decision-makers immediately adopt the appropriate acceptance threshold, even though the optimal response must often be learned. He...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sherratt, Thomas N., O'Neill, Erica
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Royal Society 2023
Materias:	Organismal and Evolutionary Biology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10282591/ https://www.ncbi.nlm.nih.gov/pubmed/37351497 http://dx.doi.org/10.1098/rsos.230157

_version_	1785061166631878656
author	Sherratt, Thomas N. O'Neill, Erica
author_facet	Sherratt, Thomas N. O'Neill, Erica
author_sort	Sherratt, Thomas N.
collection	PubMed
description	Signal detection theory (SDT) has been widely applied to identify the optimal discriminative decisions of receivers under uncertainty. However, the approach assumes that decision-makers immediately adopt the appropriate acceptance threshold, even though the optimal response must often be learned. Here we recast the classical normal–normal (and power-law) signal detection model as a contextual multi-armed bandit (CMAB). Thus, rather than starting with complete information, decision-makers must infer how the magnitude of a continuous cue is related to the probability that a signaller is desirable, while simultaneously seeking to exploit the information they acquire. We explain how various CMAB heuristics resolve the trade-off between better estimating the underlying relationship and exploiting it. Next, we determined how naive human volunteers resolve signal detection problems with a continuous cue. As anticipated, a model of choice (accept/reject) that assumed volunteers immediately adopted the SDT-predicted acceptance threshold did not predict volunteer behaviour well. The Softmax rule for solving CMABs, with choices based on a logistic function of the expected payoffs, best explained the decisions of our volunteers but a simple midpoint algorithm also predicted decisions well under some conditions. CMABs offer principled parametric solutions to solving many classical SDT problems when decision-makers start with incomplete information.
format	Online Article Text
id	pubmed-10282591
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	The Royal Society
record_format	MEDLINE/PubMed
spelling	pubmed-102825912023-06-22 Signal detection models as contextual bandits Sherratt, Thomas N. O'Neill, Erica R Soc Open Sci Organismal and Evolutionary Biology Signal detection theory (SDT) has been widely applied to identify the optimal discriminative decisions of receivers under uncertainty. However, the approach assumes that decision-makers immediately adopt the appropriate acceptance threshold, even though the optimal response must often be learned. Here we recast the classical normal–normal (and power-law) signal detection model as a contextual multi-armed bandit (CMAB). Thus, rather than starting with complete information, decision-makers must infer how the magnitude of a continuous cue is related to the probability that a signaller is desirable, while simultaneously seeking to exploit the information they acquire. We explain how various CMAB heuristics resolve the trade-off between better estimating the underlying relationship and exploiting it. Next, we determined how naive human volunteers resolve signal detection problems with a continuous cue. As anticipated, a model of choice (accept/reject) that assumed volunteers immediately adopted the SDT-predicted acceptance threshold did not predict volunteer behaviour well. The Softmax rule for solving CMABs, with choices based on a logistic function of the expected payoffs, best explained the decisions of our volunteers but a simple midpoint algorithm also predicted decisions well under some conditions. CMABs offer principled parametric solutions to solving many classical SDT problems when decision-makers start with incomplete information. The Royal Society 2023-06-21 /pmc/articles/PMC10282591/ /pubmed/37351497 http://dx.doi.org/10.1098/rsos.230157 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle	Organismal and Evolutionary Biology Sherratt, Thomas N. O'Neill, Erica Signal detection models as contextual bandits
title	Signal detection models as contextual bandits
title_full	Signal detection models as contextual bandits
title_fullStr	Signal detection models as contextual bandits
title_full_unstemmed	Signal detection models as contextual bandits
title_short	Signal detection models as contextual bandits
title_sort	signal detection models as contextual bandits
topic	Organismal and Evolutionary Biology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10282591/ https://www.ncbi.nlm.nih.gov/pubmed/37351497 http://dx.doi.org/10.1098/rsos.230157
work_keys_str_mv	AT sherrattthomasn signaldetectionmodelsascontextualbandits AT oneillerica signaldetectionmodelsascontextualbandits

Signal detection models as contextual bandits

Ejemplares similares