Cargando…
Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications
OBJECTIVE: To compare performance of risk prediction models for forecasting postoperative sepsis and acute kidney injury. DESIGN: Retrospective single center cohort study of adult surgical patients admitted between 2000 and 2010. PATIENTS: 50,318 adult patients undergoing major surgery. MEASUREMENTS...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883761/ https://www.ncbi.nlm.nih.gov/pubmed/27232332 http://dx.doi.org/10.1371/journal.pone.0155705 |
_version_ | 1782434298710720512 |
---|---|
author | Thottakkara, Paul Ozrazgat-Baslanti, Tezcan Hupf, Bradley B. Rashidi, Parisa Pardalos, Panos Momcilovic, Petar Bihorac, Azra |
author_facet | Thottakkara, Paul Ozrazgat-Baslanti, Tezcan Hupf, Bradley B. Rashidi, Parisa Pardalos, Panos Momcilovic, Petar Bihorac, Azra |
author_sort | Thottakkara, Paul |
collection | PubMed |
description | OBJECTIVE: To compare performance of risk prediction models for forecasting postoperative sepsis and acute kidney injury. DESIGN: Retrospective single center cohort study of adult surgical patients admitted between 2000 and 2010. PATIENTS: 50,318 adult patients undergoing major surgery. MEASUREMENTS: We evaluated the performance of logistic regression, generalized additive models, naïve Bayes and support vector machines for forecasting postoperative sepsis and acute kidney injury. We assessed the impact of feature reduction techniques on predictive performance. Model performance was determined using the area under the receiver operating characteristic curve, accuracy, and positive predicted value. The results were reported based on a 70/30 cross validation procedure where the data were randomly split into 70% used for training the model and the 30% for validation. MAIN RESULTS: The areas under the receiver operating characteristic curve for different models ranged between 0.797 and 0.858 for acute kidney injury and between 0.757 and 0.909 for severe sepsis. Logistic regression, generalized additive model, and support vector machines had better performance compared to Naïve Bayes model. Generalized additive models additionally accounted for non-linearity of continuous clinical variables as depicted in their risk patterns plots. Reducing the input feature space with LASSO had minimal effect on prediction performance, while feature extraction using principal component analysis improved performance of the models. CONCLUSIONS: Generalized additive models and support vector machines had good performance as risk prediction model for postoperative sepsis and AKI. Feature extraction using principal component analysis improved the predictive performance of all models. |
format | Online Article Text |
id | pubmed-4883761 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-48837612016-06-10 Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications Thottakkara, Paul Ozrazgat-Baslanti, Tezcan Hupf, Bradley B. Rashidi, Parisa Pardalos, Panos Momcilovic, Petar Bihorac, Azra PLoS One Research Article OBJECTIVE: To compare performance of risk prediction models for forecasting postoperative sepsis and acute kidney injury. DESIGN: Retrospective single center cohort study of adult surgical patients admitted between 2000 and 2010. PATIENTS: 50,318 adult patients undergoing major surgery. MEASUREMENTS: We evaluated the performance of logistic regression, generalized additive models, naïve Bayes and support vector machines for forecasting postoperative sepsis and acute kidney injury. We assessed the impact of feature reduction techniques on predictive performance. Model performance was determined using the area under the receiver operating characteristic curve, accuracy, and positive predicted value. The results were reported based on a 70/30 cross validation procedure where the data were randomly split into 70% used for training the model and the 30% for validation. MAIN RESULTS: The areas under the receiver operating characteristic curve for different models ranged between 0.797 and 0.858 for acute kidney injury and between 0.757 and 0.909 for severe sepsis. Logistic regression, generalized additive model, and support vector machines had better performance compared to Naïve Bayes model. Generalized additive models additionally accounted for non-linearity of continuous clinical variables as depicted in their risk patterns plots. Reducing the input feature space with LASSO had minimal effect on prediction performance, while feature extraction using principal component analysis improved performance of the models. CONCLUSIONS: Generalized additive models and support vector machines had good performance as risk prediction model for postoperative sepsis and AKI. Feature extraction using principal component analysis improved the predictive performance of all models. Public Library of Science 2016-05-27 /pmc/articles/PMC4883761/ /pubmed/27232332 http://dx.doi.org/10.1371/journal.pone.0155705 Text en © 2016 Thottakkara et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Thottakkara, Paul Ozrazgat-Baslanti, Tezcan Hupf, Bradley B. Rashidi, Parisa Pardalos, Panos Momcilovic, Petar Bihorac, Azra Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications |
title | Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications |
title_full | Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications |
title_fullStr | Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications |
title_full_unstemmed | Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications |
title_short | Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications |
title_sort | application of machine learning techniques to high-dimensional clinical data to forecast postoperative complications |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883761/ https://www.ncbi.nlm.nih.gov/pubmed/27232332 http://dx.doi.org/10.1371/journal.pone.0155705 |
work_keys_str_mv | AT thottakkarapaul applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications AT ozrazgatbaslantitezcan applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications AT hupfbradleyb applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications AT rashidiparisa applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications AT pardalospanos applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications AT momcilovicpetar applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications AT bihoracazra applicationofmachinelearningtechniquestohighdimensionalclinicaldatatoforecastpostoperativecomplications |