Cargando…

Expert-augmented machine learning

Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide wheth...

Descripción completa

Detalles Bibliográficos
Autores principales: Gennatas, Efstathios D., Friedman, Jerome H., Ungar, Lyle H., Pirracchio, Romain, Eaton, Eric, Reichmann, Lara G., Interian, Yannet, Luna, José Marcio, Simone, Charles B., Auerbach, Andrew, Delgado, Elier, van der Laan, Mark J., Solberg, Timothy D., Valdes, Gilmer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7060733/
https://www.ncbi.nlm.nih.gov/pubmed/32071251
http://dx.doi.org/10.1073/pnas.1906831117
_version_ 1783504286194335744
author Gennatas, Efstathios D.
Friedman, Jerome H.
Ungar, Lyle H.
Pirracchio, Romain
Eaton, Eric
Reichmann, Lara G.
Interian, Yannet
Luna, José Marcio
Simone, Charles B.
Auerbach, Andrew
Delgado, Elier
van der Laan, Mark J.
Solberg, Timothy D.
Valdes, Gilmer
author_facet Gennatas, Efstathios D.
Friedman, Jerome H.
Ungar, Lyle H.
Pirracchio, Romain
Eaton, Eric
Reichmann, Lara G.
Interian, Yannet
Luna, José Marcio
Simone, Charles B.
Auerbach, Andrew
Delgado, Elier
van der Laan, Mark J.
Solberg, Timothy D.
Valdes, Gilmer
author_sort Gennatas, Efstathios D.
collection PubMed
description Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present expert-augmented machine learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We used a large dataset of intensive-care patient data to derive 126 decision rules that predict hospital mortality. Using an online platform, we asked 15 clinicians to assess the relative risk of the subpopulation defined by each rule compared to the total sample. We compared the clinician-assessed risk to the empirical risk and found that, while clinicians agreed with the data in most cases, there were notable exceptions where they overestimated or underestimated the true risk. Studying the rules with greatest disagreement, we identified problems with the training data, including one miscoded variable and one hidden confounder. Filtering the rules based on the extent of disagreement between clinician-assessed risk and empirical risk, we improved performance on out-of-sample data and were able to train with less data. EAML provides a platform for automated creation of problem-specific priors, which help build robust and dependable machine-learning models in critical applications.
format Online
Article
Text
id pubmed-7060733
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-70607332020-03-13 Expert-augmented machine learning Gennatas, Efstathios D. Friedman, Jerome H. Ungar, Lyle H. Pirracchio, Romain Eaton, Eric Reichmann, Lara G. Interian, Yannet Luna, José Marcio Simone, Charles B. Auerbach, Andrew Delgado, Elier van der Laan, Mark J. Solberg, Timothy D. Valdes, Gilmer Proc Natl Acad Sci U S A Physical Sciences Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present expert-augmented machine learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We used a large dataset of intensive-care patient data to derive 126 decision rules that predict hospital mortality. Using an online platform, we asked 15 clinicians to assess the relative risk of the subpopulation defined by each rule compared to the total sample. We compared the clinician-assessed risk to the empirical risk and found that, while clinicians agreed with the data in most cases, there were notable exceptions where they overestimated or underestimated the true risk. Studying the rules with greatest disagreement, we identified problems with the training data, including one miscoded variable and one hidden confounder. Filtering the rules based on the extent of disagreement between clinician-assessed risk and empirical risk, we improved performance on out-of-sample data and were able to train with less data. EAML provides a platform for automated creation of problem-specific priors, which help build robust and dependable machine-learning models in critical applications. National Academy of Sciences 2020-03-03 2020-02-18 /pmc/articles/PMC7060733/ /pubmed/32071251 http://dx.doi.org/10.1073/pnas.1906831117 Text en Copyright © 2020 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Physical Sciences
Gennatas, Efstathios D.
Friedman, Jerome H.
Ungar, Lyle H.
Pirracchio, Romain
Eaton, Eric
Reichmann, Lara G.
Interian, Yannet
Luna, José Marcio
Simone, Charles B.
Auerbach, Andrew
Delgado, Elier
van der Laan, Mark J.
Solberg, Timothy D.
Valdes, Gilmer
Expert-augmented machine learning
title Expert-augmented machine learning
title_full Expert-augmented machine learning
title_fullStr Expert-augmented machine learning
title_full_unstemmed Expert-augmented machine learning
title_short Expert-augmented machine learning
title_sort expert-augmented machine learning
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7060733/
https://www.ncbi.nlm.nih.gov/pubmed/32071251
http://dx.doi.org/10.1073/pnas.1906831117
work_keys_str_mv AT gennatasefstathiosd expertaugmentedmachinelearning
AT friedmanjeromeh expertaugmentedmachinelearning
AT ungarlyleh expertaugmentedmachinelearning
AT pirracchioromain expertaugmentedmachinelearning
AT eatoneric expertaugmentedmachinelearning
AT reichmannlarag expertaugmentedmachinelearning
AT interianyannet expertaugmentedmachinelearning
AT lunajosemarcio expertaugmentedmachinelearning
AT simonecharlesb expertaugmentedmachinelearning
AT auerbachandrew expertaugmentedmachinelearning
AT delgadoelier expertaugmentedmachinelearning
AT vanderlaanmarkj expertaugmentedmachinelearning
AT solbergtimothyd expertaugmentedmachinelearning
AT valdesgilmer expertaugmentedmachinelearning