Cargando…
Bayesian modeling of human–AI complementarity
Artificial intelligence (AI) and machine learning models are being increasingly deployed in real-world applications. In many of these applications, there is strong motivation to develop hybrid systems in which humans and AI algorithms can work together, leveraging their complementary strengths and w...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8931210/ https://www.ncbi.nlm.nih.gov/pubmed/35275788 http://dx.doi.org/10.1073/pnas.2111547119 |
_version_ | 1784671206881886208 |
---|---|
author | Steyvers, Mark Tejeda, Heliodoro Kerrigan, Gavin Smyth, Padhraic |
author_facet | Steyvers, Mark Tejeda, Heliodoro Kerrigan, Gavin Smyth, Padhraic |
author_sort | Steyvers, Mark |
collection | PubMed |
description | Artificial intelligence (AI) and machine learning models are being increasingly deployed in real-world applications. In many of these applications, there is strong motivation to develop hybrid systems in which humans and AI algorithms can work together, leveraging their complementary strengths and weaknesses. We develop a Bayesian framework for combining the predictions and different types of confidence scores from humans and machines. The framework allows us to investigate the factors that influence complementarity, where a hybrid combination of human and machine predictions leads to better performance than combinations of human or machine predictions alone. We apply this framework to a large-scale dataset where humans and a variety of convolutional neural networks perform the same challenging image classification task. We show empirically and theoretically that complementarity can be achieved even if the human and machine classifiers perform at different accuracy levels as long as these accuracy differences fall within a bound determined by the latent correlation between human and machine classifier confidence scores. In addition, we demonstrate that hybrid human–machine performance can be improved by differentiating between the errors that humans and machine classifiers make across different class labels. Finally, our results show that eliciting and including human confidence ratings improve hybrid performance in the Bayesian combination model. Our approach is applicable to a wide variety of classification problems involving human and machine algorithms. |
format | Online Article Text |
id | pubmed-8931210 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-89312102022-03-19 Bayesian modeling of human–AI complementarity Steyvers, Mark Tejeda, Heliodoro Kerrigan, Gavin Smyth, Padhraic Proc Natl Acad Sci U S A Social Sciences Artificial intelligence (AI) and machine learning models are being increasingly deployed in real-world applications. In many of these applications, there is strong motivation to develop hybrid systems in which humans and AI algorithms can work together, leveraging their complementary strengths and weaknesses. We develop a Bayesian framework for combining the predictions and different types of confidence scores from humans and machines. The framework allows us to investigate the factors that influence complementarity, where a hybrid combination of human and machine predictions leads to better performance than combinations of human or machine predictions alone. We apply this framework to a large-scale dataset where humans and a variety of convolutional neural networks perform the same challenging image classification task. We show empirically and theoretically that complementarity can be achieved even if the human and machine classifiers perform at different accuracy levels as long as these accuracy differences fall within a bound determined by the latent correlation between human and machine classifier confidence scores. In addition, we demonstrate that hybrid human–machine performance can be improved by differentiating between the errors that humans and machine classifiers make across different class labels. Finally, our results show that eliciting and including human confidence ratings improve hybrid performance in the Bayesian combination model. Our approach is applicable to a wide variety of classification problems involving human and machine algorithms. National Academy of Sciences 2022-03-11 2022-03-15 /pmc/articles/PMC8931210/ /pubmed/35275788 http://dx.doi.org/10.1073/pnas.2111547119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Social Sciences Steyvers, Mark Tejeda, Heliodoro Kerrigan, Gavin Smyth, Padhraic Bayesian modeling of human–AI complementarity |
title | Bayesian modeling of human–AI complementarity |
title_full | Bayesian modeling of human–AI complementarity |
title_fullStr | Bayesian modeling of human–AI complementarity |
title_full_unstemmed | Bayesian modeling of human–AI complementarity |
title_short | Bayesian modeling of human–AI complementarity |
title_sort | bayesian modeling of human–ai complementarity |
topic | Social Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8931210/ https://www.ncbi.nlm.nih.gov/pubmed/35275788 http://dx.doi.org/10.1073/pnas.2111547119 |
work_keys_str_mv | AT steyversmark bayesianmodelingofhumanaicomplementarity AT tejedaheliodoro bayesianmodelingofhumanaicomplementarity AT kerrigangavin bayesianmodelingofhumanaicomplementarity AT smythpadhraic bayesianmodelingofhumanaicomplementarity |