Cargando…

Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications

OBJECTIVE: To compare performance of risk prediction models for forecasting postoperative sepsis and acute kidney injury. DESIGN: Retrospective single center cohort study of adult surgical patients admitted between 2000 and 2010. PATIENTS: 50,318 adult patients undergoing major surgery. MEASUREMENTS...

Descripción completa

Detalles Bibliográficos
Autores principales: Thottakkara, Paul, Ozrazgat-Baslanti, Tezcan, Hupf, Bradley B., Rashidi, Parisa, Pardalos, Panos, Momcilovic, Petar, Bihorac, Azra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883761/
https://www.ncbi.nlm.nih.gov/pubmed/27232332
http://dx.doi.org/10.1371/journal.pone.0155705
_version_ 1782434298710720512
author Thottakkara, Paul
Ozrazgat-Baslanti, Tezcan
Hupf, Bradley B.
Rashidi, Parisa
Pardalos, Panos
Momcilovic, Petar
Bihorac, Azra
author_facet Thottakkara, Paul
Ozrazgat-Baslanti, Tezcan
Hupf, Bradley B.
Rashidi, Parisa
Pardalos, Panos
Momcilovic, Petar
Bihorac, Azra
author_sort Thottakkara, Paul
collection PubMed
description OBJECTIVE: To compare performance of risk prediction models for forecasting postoperative sepsis and acute kidney injury. DESIGN: Retrospective single center cohort study of adult surgical patients admitted between 2000 and 2010. PATIENTS: 50,318 adult patients undergoing major surgery. MEASUREMENTS: We evaluated the performance of logistic regression, generalized additive models, naïve Bayes and support vector machines for forecasting postoperative sepsis and acute kidney injury. We assessed the impact of feature reduction techniques on predictive performance. Model performance was determined using the area under the receiver operating characteristic curve, accuracy, and positive predicted value. The results were reported based on a 70/30 cross validation procedure where the data were randomly split into 70% used for training the model and the 30% for validation. MAIN RESULTS: The areas under the receiver operating characteristic curve for different models ranged between 0.797 and 0.858 for acute kidney injury and between 0.757 and 0.909 for severe sepsis. Logistic regression, generalized additive model, and support vector machines had better performance compared to Naïve Bayes model. Generalized additive models additionally accounted for non-linearity of continuous clinical variables as depicted in their risk patterns plots. Reducing the input feature space with LASSO had minimal effect on prediction performance, while feature extraction using principal component analysis improved performance of the models. CONCLUSIONS: Generalized additive models and support vector machines had good performance as risk prediction model for postoperative sepsis and AKI. Feature extraction using principal component analysis improved the predictive performance of all models.
format Online
Article
Text
id pubmed-4883761
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48837612016-06-10 Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications Thottakkara, Paul Ozrazgat-Baslanti, Tezcan Hupf, Bradley B. Rashidi, Parisa Pardalos, Panos Momcilovic, Petar Bihorac, Azra PLoS One Research Article OBJECTIVE: To compare performance of risk prediction models for forecasting postoperative sepsis and acute kidney injury. DESIGN: Retrospective single center cohort study of adult surgical patients admitted between 2000 and 2010. PATIENTS: 50,318 adult patients undergoing major surgery. MEASUREMENTS: We evaluated the performance of logistic regression, generalized additive models, naïve Bayes and support vector machines for forecasting postoperative sepsis and acute kidney injury. We assessed the impact of feature reduction techniques on predictive performance. Model performance was determined using the area under the receiver operating characteristic curve, accuracy, and positive predicted value. The results were reported based on a 70/30 cross validation procedure where the data were randomly split into 70% used for training the model and the 30% for validation. MAIN RESULTS: The areas under the receiver operating characteristic curve for different models ranged between 0.797 and 0.858 for acute kidney injury and between 0.757 and 0.909 for severe sepsis. Logistic regression, generalized additive model, and support vector machines had better performance compared to Naïve Bayes model. Generalized additive models additionally accounted for non-linearity of continuous clinical variables as depicted in their risk patterns plots. Reducing the input feature space with LASSO had minimal effect on prediction performance, while feature extraction using principal component analysis improved performance of the models. CONCLUSIONS: Generalized additive models and support vector machines had good performance as risk prediction model for postoperative sepsis and AKI. Feature extraction using principal component analysis improved the predictive performance of all models. Public Library of Science 2016-05-27 /pmc/articles/PMC4883761/ /pubmed/27232332 http://dx.doi.org/10.1371/journal.pone.0155705 Text en © 2016 Thottakkara et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Thottakkara, Paul
Ozrazgat-Baslanti, Tezcan
Hupf, Bradley B.
Rashidi, Parisa
Pardalos, Panos
Momcilovic, Petar
Bihorac, Azra
Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
title Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
title_full Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
title_fullStr Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
title_full_unstemmed Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
title_short Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
title_sort application of machine learning techniques to high-dimensional clinical data to forecast postoperative complications
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883761/
https://www.ncbi.nlm.nih.gov/pubmed/27232332
http://dx.doi.org/10.1371/journal.pone.0155705
work_keys_str_mv AT thottakkarapaul applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications
AT ozrazgatbaslantitezcan applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications
AT hupfbradleyb applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications
AT rashidiparisa applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications
AT pardalospanos applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications
AT momcilovicpetar applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications
AT bihoracazra applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications