Cargando…

Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction

BACKGROUND: Predictive modeling is a key component of solutions to many healthcare problems. Among all predictive modeling approaches, machine learning methods often achieve the highest prediction accuracy, but suffer from a long-standing open problem precluding their widespread use in healthcare. M...

Descripción completa

Detalles Bibliográficos
Autor principal: Luo, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4782293/
https://www.ncbi.nlm.nih.gov/pubmed/26958341
http://dx.doi.org/10.1186/s13755-016-0015-4
_version_ 1782419922618417152
author Luo, Gang
author_facet Luo, Gang
author_sort Luo, Gang
collection PubMed
description BACKGROUND: Predictive modeling is a key component of solutions to many healthcare problems. Among all predictive modeling approaches, machine learning methods often achieve the highest prediction accuracy, but suffer from a long-standing open problem precluding their widespread use in healthcare. Most machine learning models give no explanation for their prediction results, whereas interpretability is essential for a predictive model to be adopted in typical healthcare settings. METHODS: This paper presents the first complete method for automatically explaining results for any machine learning predictive model without degrading accuracy. We did a computer coding implementation of the method. Using the electronic medical record data set from the Practice Fusion diabetes classification competition containing patient records from all 50 states in the United States, we demonstrated the method on predicting type 2 diabetes diagnosis within the next year. RESULTS: For the champion machine learning model of the competition, our method explained prediction results for 87.4 % of patients who were correctly predicted by the model to have type 2 diabetes diagnosis within the next year. CONCLUSIONS: Our demonstration showed the feasibility of automatically explaining results for any machine learning predictive model without degrading accuracy.
format Online
Article
Text
id pubmed-4782293
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47822932016-03-09 Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction Luo, Gang Health Inf Sci Syst Research BACKGROUND: Predictive modeling is a key component of solutions to many healthcare problems. Among all predictive modeling approaches, machine learning methods often achieve the highest prediction accuracy, but suffer from a long-standing open problem precluding their widespread use in healthcare. Most machine learning models give no explanation for their prediction results, whereas interpretability is essential for a predictive model to be adopted in typical healthcare settings. METHODS: This paper presents the first complete method for automatically explaining results for any machine learning predictive model without degrading accuracy. We did a computer coding implementation of the method. Using the electronic medical record data set from the Practice Fusion diabetes classification competition containing patient records from all 50 states in the United States, we demonstrated the method on predicting type 2 diabetes diagnosis within the next year. RESULTS: For the champion machine learning model of the competition, our method explained prediction results for 87.4 % of patients who were correctly predicted by the model to have type 2 diabetes diagnosis within the next year. CONCLUSIONS: Our demonstration showed the feasibility of automatically explaining results for any machine learning predictive model without degrading accuracy. BioMed Central 2016-03-08 /pmc/articles/PMC4782293/ /pubmed/26958341 http://dx.doi.org/10.1186/s13755-016-0015-4 Text en © Luo. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Luo, Gang
Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction
title Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction
title_full Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction
title_fullStr Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction
title_full_unstemmed Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction
title_short Automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction
title_sort automatically explaining machine learning prediction results: a demonstration on type 2 diabetes risk prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4782293/
https://www.ncbi.nlm.nih.gov/pubmed/26958341
http://dx.doi.org/10.1186/s13755-016-0015-4
work_keys_str_mv AT luogang automaticallyexplainingmachinelearningpredictionresultsademonstrationontype2diabetesriskprediction