Cargando…

Deep learning predicts hip fracture using confounding patient and healthcare variables

Hip fractures are a leading cause of death and disability among older adults. Hip fractures are also the most commonly missed diagnosis on pelvic radiographs, and delayed diagnosis leads to higher cost and worse outcomes. Computer-aided diagnosis (CAD) algorithms have shown promise for helping radio...

Descripción completa

Detalles Bibliográficos
Autores principales: Badgeley, Marcus A., Zech, John R., Oakden-Rayner, Luke, Glicksberg, Benjamin S., Liu, Manway, Gale, William, McConnell, Michael V., Percha, Bethany, Snyder, Thomas M., Dudley, Joel T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550136/
https://www.ncbi.nlm.nih.gov/pubmed/31304378
http://dx.doi.org/10.1038/s41746-019-0105-1
_version_ 1783424135165116416
author Badgeley, Marcus A.
Zech, John R.
Oakden-Rayner, Luke
Glicksberg, Benjamin S.
Liu, Manway
Gale, William
McConnell, Michael V.
Percha, Bethany
Snyder, Thomas M.
Dudley, Joel T.
author_facet Badgeley, Marcus A.
Zech, John R.
Oakden-Rayner, Luke
Glicksberg, Benjamin S.
Liu, Manway
Gale, William
McConnell, Michael V.
Percha, Bethany
Snyder, Thomas M.
Dudley, Joel T.
author_sort Badgeley, Marcus A.
collection PubMed
description Hip fractures are a leading cause of death and disability among older adults. Hip fractures are also the most commonly missed diagnosis on pelvic radiographs, and delayed diagnosis leads to higher cost and worse outcomes. Computer-aided diagnosis (CAD) algorithms have shown promise for helping radiologists detect fractures, but the image features underpinning their predictions are notoriously difficult to understand. In this study, we trained deep-learning models on 17,587 radiographs to classify fracture, 5 patient traits, and 14 hospital process variables. All 20 variables could be individually predicted from a radiograph, with the best performances on scanner model (AUC = 1.00), scanner brand (AUC = 0.98), and whether the order was marked “priority” (AUC = 0.79). Fracture was predicted moderately well from the image (AUC = 0.78) and better when combining image features with patient data (AUC = 0.86, DeLong paired AUC comparison, p = 2e-9) or patient data plus hospital process features (AUC = 0.91, p = 1e-21). Fracture prediction on a test set that balanced fracture risk across patient variables was significantly lower than a random test set (AUC = 0.67, DeLong unpaired AUC comparison, p = 0.003); and on a test set with fracture risk balanced across patient and hospital process variables, the model performed randomly (AUC = 0.52, 95% CI 0.46–0.58), indicating that these variables were the main source of the model’s fracture predictions. A single model that directly combines image features, patient, and hospital process data outperforms a Naive Bayes ensemble of an image-only model prediction, patient, and hospital process data. If CAD algorithms are inexplicably leveraging patient and process variables in their predictions, it is unclear how radiologists should interpret their predictions in the context of other known patient data. Further research is needed to illuminate deep-learning decision processes so that computers and clinicians can effectively cooperate.
format Online
Article
Text
id pubmed-6550136
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-65501362019-07-12 Deep learning predicts hip fracture using confounding patient and healthcare variables Badgeley, Marcus A. Zech, John R. Oakden-Rayner, Luke Glicksberg, Benjamin S. Liu, Manway Gale, William McConnell, Michael V. Percha, Bethany Snyder, Thomas M. Dudley, Joel T. NPJ Digit Med Article Hip fractures are a leading cause of death and disability among older adults. Hip fractures are also the most commonly missed diagnosis on pelvic radiographs, and delayed diagnosis leads to higher cost and worse outcomes. Computer-aided diagnosis (CAD) algorithms have shown promise for helping radiologists detect fractures, but the image features underpinning their predictions are notoriously difficult to understand. In this study, we trained deep-learning models on 17,587 radiographs to classify fracture, 5 patient traits, and 14 hospital process variables. All 20 variables could be individually predicted from a radiograph, with the best performances on scanner model (AUC = 1.00), scanner brand (AUC = 0.98), and whether the order was marked “priority” (AUC = 0.79). Fracture was predicted moderately well from the image (AUC = 0.78) and better when combining image features with patient data (AUC = 0.86, DeLong paired AUC comparison, p = 2e-9) or patient data plus hospital process features (AUC = 0.91, p = 1e-21). Fracture prediction on a test set that balanced fracture risk across patient variables was significantly lower than a random test set (AUC = 0.67, DeLong unpaired AUC comparison, p = 0.003); and on a test set with fracture risk balanced across patient and hospital process variables, the model performed randomly (AUC = 0.52, 95% CI 0.46–0.58), indicating that these variables were the main source of the model’s fracture predictions. A single model that directly combines image features, patient, and hospital process data outperforms a Naive Bayes ensemble of an image-only model prediction, patient, and hospital process data. If CAD algorithms are inexplicably leveraging patient and process variables in their predictions, it is unclear how radiologists should interpret their predictions in the context of other known patient data. Further research is needed to illuminate deep-learning decision processes so that computers and clinicians can effectively cooperate. Nature Publishing Group UK 2019-04-30 /pmc/articles/PMC6550136/ /pubmed/31304378 http://dx.doi.org/10.1038/s41746-019-0105-1 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Badgeley, Marcus A.
Zech, John R.
Oakden-Rayner, Luke
Glicksberg, Benjamin S.
Liu, Manway
Gale, William
McConnell, Michael V.
Percha, Bethany
Snyder, Thomas M.
Dudley, Joel T.
Deep learning predicts hip fracture using confounding patient and healthcare variables
title Deep learning predicts hip fracture using confounding patient and healthcare variables
title_full Deep learning predicts hip fracture using confounding patient and healthcare variables
title_fullStr Deep learning predicts hip fracture using confounding patient and healthcare variables
title_full_unstemmed Deep learning predicts hip fracture using confounding patient and healthcare variables
title_short Deep learning predicts hip fracture using confounding patient and healthcare variables
title_sort deep learning predicts hip fracture using confounding patient and healthcare variables
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550136/
https://www.ncbi.nlm.nih.gov/pubmed/31304378
http://dx.doi.org/10.1038/s41746-019-0105-1
work_keys_str_mv AT badgeleymarcusa deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT zechjohnr deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT oakdenraynerluke deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT glicksbergbenjamins deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT liumanway deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT galewilliam deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT mcconnellmichaelv deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT perchabethany deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT snyderthomasm deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables
AT dudleyjoelt deeplearningpredictshipfractureusingconfoundingpatientandhealthcarevariables