Cargando…
Expert-augmented machine learning
Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide wheth...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7060733/ https://www.ncbi.nlm.nih.gov/pubmed/32071251 http://dx.doi.org/10.1073/pnas.1906831117 |
_version_ | 1783504286194335744 |
---|---|
author | Gennatas, Efstathios D. Friedman, Jerome H. Ungar, Lyle H. Pirracchio, Romain Eaton, Eric Reichmann, Lara G. Interian, Yannet Luna, José Marcio Simone, Charles B. Auerbach, Andrew Delgado, Elier van der Laan, Mark J. Solberg, Timothy D. Valdes, Gilmer |
author_facet | Gennatas, Efstathios D. Friedman, Jerome H. Ungar, Lyle H. Pirracchio, Romain Eaton, Eric Reichmann, Lara G. Interian, Yannet Luna, José Marcio Simone, Charles B. Auerbach, Andrew Delgado, Elier van der Laan, Mark J. Solberg, Timothy D. Valdes, Gilmer |
author_sort | Gennatas, Efstathios D. |
collection | PubMed |
description | Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present expert-augmented machine learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We used a large dataset of intensive-care patient data to derive 126 decision rules that predict hospital mortality. Using an online platform, we asked 15 clinicians to assess the relative risk of the subpopulation defined by each rule compared to the total sample. We compared the clinician-assessed risk to the empirical risk and found that, while clinicians agreed with the data in most cases, there were notable exceptions where they overestimated or underestimated the true risk. Studying the rules with greatest disagreement, we identified problems with the training data, including one miscoded variable and one hidden confounder. Filtering the rules based on the extent of disagreement between clinician-assessed risk and empirical risk, we improved performance on out-of-sample data and were able to train with less data. EAML provides a platform for automated creation of problem-specific priors, which help build robust and dependable machine-learning models in critical applications. |
format | Online Article Text |
id | pubmed-7060733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-70607332020-03-13 Expert-augmented machine learning Gennatas, Efstathios D. Friedman, Jerome H. Ungar, Lyle H. Pirracchio, Romain Eaton, Eric Reichmann, Lara G. Interian, Yannet Luna, José Marcio Simone, Charles B. Auerbach, Andrew Delgado, Elier van der Laan, Mark J. Solberg, Timothy D. Valdes, Gilmer Proc Natl Acad Sci U S A Physical Sciences Machine learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption is limited by the level of trust afforded by given models. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of humans and machines. Here, we present expert-augmented machine learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We used a large dataset of intensive-care patient data to derive 126 decision rules that predict hospital mortality. Using an online platform, we asked 15 clinicians to assess the relative risk of the subpopulation defined by each rule compared to the total sample. We compared the clinician-assessed risk to the empirical risk and found that, while clinicians agreed with the data in most cases, there were notable exceptions where they overestimated or underestimated the true risk. Studying the rules with greatest disagreement, we identified problems with the training data, including one miscoded variable and one hidden confounder. Filtering the rules based on the extent of disagreement between clinician-assessed risk and empirical risk, we improved performance on out-of-sample data and were able to train with less data. EAML provides a platform for automated creation of problem-specific priors, which help build robust and dependable machine-learning models in critical applications. National Academy of Sciences 2020-03-03 2020-02-18 /pmc/articles/PMC7060733/ /pubmed/32071251 http://dx.doi.org/10.1073/pnas.1906831117 Text en Copyright © 2020 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Physical Sciences Gennatas, Efstathios D. Friedman, Jerome H. Ungar, Lyle H. Pirracchio, Romain Eaton, Eric Reichmann, Lara G. Interian, Yannet Luna, José Marcio Simone, Charles B. Auerbach, Andrew Delgado, Elier van der Laan, Mark J. Solberg, Timothy D. Valdes, Gilmer Expert-augmented machine learning |
title | Expert-augmented machine learning |
title_full | Expert-augmented machine learning |
title_fullStr | Expert-augmented machine learning |
title_full_unstemmed | Expert-augmented machine learning |
title_short | Expert-augmented machine learning |
title_sort | expert-augmented machine learning |
topic | Physical Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7060733/ https://www.ncbi.nlm.nih.gov/pubmed/32071251 http://dx.doi.org/10.1073/pnas.1906831117 |
work_keys_str_mv | AT gennatasefstathiosd expertaugmentedmachinelearning AT friedmanjeromeh expertaugmentedmachinelearning AT ungarlyleh expertaugmentedmachinelearning AT pirracchioromain expertaugmentedmachinelearning AT eatoneric expertaugmentedmachinelearning AT reichmannlarag expertaugmentedmachinelearning AT interianyannet expertaugmentedmachinelearning AT lunajosemarcio expertaugmentedmachinelearning AT simonecharlesb expertaugmentedmachinelearning AT auerbachandrew expertaugmentedmachinelearning AT delgadoelier expertaugmentedmachinelearning AT vanderlaanmarkj expertaugmentedmachinelearning AT solbergtimothyd expertaugmentedmachinelearning AT valdesgilmer expertaugmentedmachinelearning |