Cargando…

Bayesian modeling of human–AI complementarity

Artificial intelligence (AI) and machine learning models are being increasingly deployed in real-world applications. In many of these applications, there is strong motivation to develop hybrid systems in which humans and AI algorithms can work together, leveraging their complementary strengths and w...

Descripción completa

Detalles Bibliográficos
Autores principales: Steyvers, Mark, Tejeda, Heliodoro, Kerrigan, Gavin, Smyth, Padhraic
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8931210/
https://www.ncbi.nlm.nih.gov/pubmed/35275788
http://dx.doi.org/10.1073/pnas.2111547119
_version_ 1784671206881886208
author Steyvers, Mark
Tejeda, Heliodoro
Kerrigan, Gavin
Smyth, Padhraic
author_facet Steyvers, Mark
Tejeda, Heliodoro
Kerrigan, Gavin
Smyth, Padhraic
author_sort Steyvers, Mark
collection PubMed
description Artificial intelligence (AI) and machine learning models are being increasingly deployed in real-world applications. In many of these applications, there is strong motivation to develop hybrid systems in which humans and AI algorithms can work together, leveraging their complementary strengths and weaknesses. We develop a Bayesian framework for combining the predictions and different types of confidence scores from humans and machines. The framework allows us to investigate the factors that influence complementarity, where a hybrid combination of human and machine predictions leads to better performance than combinations of human or machine predictions alone. We apply this framework to a large-scale dataset where humans and a variety of convolutional neural networks perform the same challenging image classification task. We show empirically and theoretically that complementarity can be achieved even if the human and machine classifiers perform at different accuracy levels as long as these accuracy differences fall within a bound determined by the latent correlation between human and machine classifier confidence scores. In addition, we demonstrate that hybrid human–machine performance can be improved by differentiating between the errors that humans and machine classifiers make across different class labels. Finally, our results show that eliciting and including human confidence ratings improve hybrid performance in the Bayesian combination model. Our approach is applicable to a wide variety of classification problems involving human and machine algorithms.
format Online
Article
Text
id pubmed-8931210
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-89312102022-03-19 Bayesian modeling of human–AI complementarity Steyvers, Mark Tejeda, Heliodoro Kerrigan, Gavin Smyth, Padhraic Proc Natl Acad Sci U S A Social Sciences Artificial intelligence (AI) and machine learning models are being increasingly deployed in real-world applications. In many of these applications, there is strong motivation to develop hybrid systems in which humans and AI algorithms can work together, leveraging their complementary strengths and weaknesses. We develop a Bayesian framework for combining the predictions and different types of confidence scores from humans and machines. The framework allows us to investigate the factors that influence complementarity, where a hybrid combination of human and machine predictions leads to better performance than combinations of human or machine predictions alone. We apply this framework to a large-scale dataset where humans and a variety of convolutional neural networks perform the same challenging image classification task. We show empirically and theoretically that complementarity can be achieved even if the human and machine classifiers perform at different accuracy levels as long as these accuracy differences fall within a bound determined by the latent correlation between human and machine classifier confidence scores. In addition, we demonstrate that hybrid human–machine performance can be improved by differentiating between the errors that humans and machine classifiers make across different class labels. Finally, our results show that eliciting and including human confidence ratings improve hybrid performance in the Bayesian combination model. Our approach is applicable to a wide variety of classification problems involving human and machine algorithms. National Academy of Sciences 2022-03-11 2022-03-15 /pmc/articles/PMC8931210/ /pubmed/35275788 http://dx.doi.org/10.1073/pnas.2111547119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Social Sciences
Steyvers, Mark
Tejeda, Heliodoro
Kerrigan, Gavin
Smyth, Padhraic
Bayesian modeling of human–AI complementarity
title Bayesian modeling of human–AI complementarity
title_full Bayesian modeling of human–AI complementarity
title_fullStr Bayesian modeling of human–AI complementarity
title_full_unstemmed Bayesian modeling of human–AI complementarity
title_short Bayesian modeling of human–AI complementarity
title_sort bayesian modeling of human–ai complementarity
topic Social Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8931210/
https://www.ncbi.nlm.nih.gov/pubmed/35275788
http://dx.doi.org/10.1073/pnas.2111547119
work_keys_str_mv AT steyversmark bayesianmodelingofhumanaicomplementarity
AT tejedaheliodoro bayesianmodelingofhumanaicomplementarity
AT kerrigangavin bayesianmodelingofhumanaicomplementarity
AT smythpadhraic bayesianmodelingofhumanaicomplementarity