Cargando…
Comparison and interpretability of machine learning models to predict severity of chest injury
OBJECTIVE: Trauma quality improvement programs and registries improve care and outcomes for injured patients. Designated trauma centers calculate injury scores using dedicated trauma registrars; however, many injuries arrive at nontrauma centers, leaving a substantial amount of data uncaptured. We p...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7935500/ https://www.ncbi.nlm.nih.gov/pubmed/33709067 http://dx.doi.org/10.1093/jamiaopen/ooab015 |
_version_ | 1783661009452400640 |
---|---|
author | Kulshrestha, Sujay Dligach, Dmitriy Joyce, Cara Gonzalez, Richard O’Rourke, Ann P Glazer, Joshua M Stey, Anne Kruser, Jacqueline M Churpek, Matthew M Afshar, Majid |
author_facet | Kulshrestha, Sujay Dligach, Dmitriy Joyce, Cara Gonzalez, Richard O’Rourke, Ann P Glazer, Joshua M Stey, Anne Kruser, Jacqueline M Churpek, Matthew M Afshar, Majid |
author_sort | Kulshrestha, Sujay |
collection | PubMed |
description | OBJECTIVE: Trauma quality improvement programs and registries improve care and outcomes for injured patients. Designated trauma centers calculate injury scores using dedicated trauma registrars; however, many injuries arrive at nontrauma centers, leaving a substantial amount of data uncaptured. We propose automated methods to identify severe chest injury using machine learning (ML) and natural language processing (NLP) methods from the electronic health record (EHR) for quality reporting. MATERIALS AND METHODS: A level I trauma center was queried for patients presenting after injury between 2014 and 2018. Prediction modeling was performed to classify severe chest injury using a reference dataset labeled by certified registrars. Clinical documents from trauma encounters were processed into concept unique identifiers for inputs to ML models: logistic regression with elastic net (EN) regularization, extreme gradient boosted (XGB) machines, and convolutional neural networks (CNN). The optimal model was identified by examining predictive and face validity metrics using global explanations. RESULTS: Of 8952 encounters, 542 (6.1%) had a severe chest injury. CNN and EN had the highest discrimination, with an area under the receiver operating characteristic curve of 0.93 and calibration slopes between 0.88 and 0.97. CNN had better performance across risk thresholds with fewer discordant cases. Examination of global explanations demonstrated the CNN model had better face validity, with top features including “contusion of lung” and “hemopneumothorax.” DISCUSSION: The CNN model featured optimal discrimination, calibration, and clinically relevant features selected. CONCLUSION: NLP and ML methods to populate trauma registries for quality analyses are feasible. |
format | Online Article Text |
id | pubmed-7935500 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-79355002021-03-10 Comparison and interpretability of machine learning models to predict severity of chest injury Kulshrestha, Sujay Dligach, Dmitriy Joyce, Cara Gonzalez, Richard O’Rourke, Ann P Glazer, Joshua M Stey, Anne Kruser, Jacqueline M Churpek, Matthew M Afshar, Majid JAMIA Open Research and Applications OBJECTIVE: Trauma quality improvement programs and registries improve care and outcomes for injured patients. Designated trauma centers calculate injury scores using dedicated trauma registrars; however, many injuries arrive at nontrauma centers, leaving a substantial amount of data uncaptured. We propose automated methods to identify severe chest injury using machine learning (ML) and natural language processing (NLP) methods from the electronic health record (EHR) for quality reporting. MATERIALS AND METHODS: A level I trauma center was queried for patients presenting after injury between 2014 and 2018. Prediction modeling was performed to classify severe chest injury using a reference dataset labeled by certified registrars. Clinical documents from trauma encounters were processed into concept unique identifiers for inputs to ML models: logistic regression with elastic net (EN) regularization, extreme gradient boosted (XGB) machines, and convolutional neural networks (CNN). The optimal model was identified by examining predictive and face validity metrics using global explanations. RESULTS: Of 8952 encounters, 542 (6.1%) had a severe chest injury. CNN and EN had the highest discrimination, with an area under the receiver operating characteristic curve of 0.93 and calibration slopes between 0.88 and 0.97. CNN had better performance across risk thresholds with fewer discordant cases. Examination of global explanations demonstrated the CNN model had better face validity, with top features including “contusion of lung” and “hemopneumothorax.” DISCUSSION: The CNN model featured optimal discrimination, calibration, and clinically relevant features selected. CONCLUSION: NLP and ML methods to populate trauma registries for quality analyses are feasible. Oxford University Press 2021-03-01 /pmc/articles/PMC7935500/ /pubmed/33709067 http://dx.doi.org/10.1093/jamiaopen/ooab015 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Research and Applications Kulshrestha, Sujay Dligach, Dmitriy Joyce, Cara Gonzalez, Richard O’Rourke, Ann P Glazer, Joshua M Stey, Anne Kruser, Jacqueline M Churpek, Matthew M Afshar, Majid Comparison and interpretability of machine learning models to predict severity of chest injury |
title | Comparison and interpretability of machine learning models to predict severity of chest injury |
title_full | Comparison and interpretability of machine learning models to predict severity of chest injury |
title_fullStr | Comparison and interpretability of machine learning models to predict severity of chest injury |
title_full_unstemmed | Comparison and interpretability of machine learning models to predict severity of chest injury |
title_short | Comparison and interpretability of machine learning models to predict severity of chest injury |
title_sort | comparison and interpretability of machine learning models to predict severity of chest injury |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7935500/ https://www.ncbi.nlm.nih.gov/pubmed/33709067 http://dx.doi.org/10.1093/jamiaopen/ooab015 |
work_keys_str_mv | AT kulshresthasujay comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT dligachdmitriy comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT joycecara comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT gonzalezrichard comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT orourkeannp comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT glazerjoshuam comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT steyanne comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT kruserjacquelinem comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT churpekmatthewm comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury AT afsharmajid comparisonandinterpretabilityofmachinelearningmodelstopredictseverityofchestinjury |