Cargando…
Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data
This study describes a novel approach to solve the surgical site infection (SSI) classification problem. Feature engineering has traditionally been one of the most important steps in solving complex classification problems, especially in cases with temporal data. The described novel approach is base...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6399553/ https://www.ncbi.nlm.nih.gov/pubmed/30915154 http://dx.doi.org/10.1155/2019/2059851 |
_version_ | 1783399782788628480 |
---|---|
author | Kocbek, Primoz Fijacko, Nino Soguero-Ruiz, Cristina Mikalsen, Karl Øyvind Maver, Uros Povalej Brzan, Petra Stozer, Andraz Jenssen, Robert Skrøvseth, Stein Olav Stiglic, Gregor |
author_facet | Kocbek, Primoz Fijacko, Nino Soguero-Ruiz, Cristina Mikalsen, Karl Øyvind Maver, Uros Povalej Brzan, Petra Stozer, Andraz Jenssen, Robert Skrøvseth, Stein Olav Stiglic, Gregor |
author_sort | Kocbek, Primoz |
collection | PubMed |
description | This study describes a novel approach to solve the surgical site infection (SSI) classification problem. Feature engineering has traditionally been one of the most important steps in solving complex classification problems, especially in cases with temporal data. The described novel approach is based on abstraction of temporal data recorded in three temporal windows. Maximum likelihood L1-norm (lasso) regularization was used in penalized logistic regression to predict the onset of surgical site infection occurrence based on available patient blood testing results up to the day of surgery. Prior knowledge of predictors (blood tests) was integrated in the modelling by introduction of penalty factors depending on blood test prices and an early stopping parameter limiting the maximum number of selected features used in predictive modelling. Finally, solutions resulting in higher interpretability and cost-effectiveness were demonstrated. Using repeated holdout cross-validation, the baseline C-reactive protein (CRP) classifier achieved a mean AUC of 0.801, whereas our best full lasso model achieved a mean AUC of 0.956. Best model testing results were achieved for full lasso model with maximum number of features limited at 20 features with an AUC of 0.967. Presented models showed the potential to not only support domain experts in their decision making but could also prove invaluable for improvement in prediction of SSI occurrence, which may even help setting new guidelines in the field of preoperative SSI prevention and surveillance. |
format | Online Article Text |
id | pubmed-6399553 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-63995532019-03-26 Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data Kocbek, Primoz Fijacko, Nino Soguero-Ruiz, Cristina Mikalsen, Karl Øyvind Maver, Uros Povalej Brzan, Petra Stozer, Andraz Jenssen, Robert Skrøvseth, Stein Olav Stiglic, Gregor Comput Math Methods Med Research Article This study describes a novel approach to solve the surgical site infection (SSI) classification problem. Feature engineering has traditionally been one of the most important steps in solving complex classification problems, especially in cases with temporal data. The described novel approach is based on abstraction of temporal data recorded in three temporal windows. Maximum likelihood L1-norm (lasso) regularization was used in penalized logistic regression to predict the onset of surgical site infection occurrence based on available patient blood testing results up to the day of surgery. Prior knowledge of predictors (blood tests) was integrated in the modelling by introduction of penalty factors depending on blood test prices and an early stopping parameter limiting the maximum number of selected features used in predictive modelling. Finally, solutions resulting in higher interpretability and cost-effectiveness were demonstrated. Using repeated holdout cross-validation, the baseline C-reactive protein (CRP) classifier achieved a mean AUC of 0.801, whereas our best full lasso model achieved a mean AUC of 0.956. Best model testing results were achieved for full lasso model with maximum number of features limited at 20 features with an AUC of 0.967. Presented models showed the potential to not only support domain experts in their decision making but could also prove invaluable for improvement in prediction of SSI occurrence, which may even help setting new guidelines in the field of preoperative SSI prevention and surveillance. Hindawi 2019-02-19 /pmc/articles/PMC6399553/ /pubmed/30915154 http://dx.doi.org/10.1155/2019/2059851 Text en Copyright © 2019 Primoz Kocbek et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Kocbek, Primoz Fijacko, Nino Soguero-Ruiz, Cristina Mikalsen, Karl Øyvind Maver, Uros Povalej Brzan, Petra Stozer, Andraz Jenssen, Robert Skrøvseth, Stein Olav Stiglic, Gregor Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data |
title | Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data |
title_full | Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data |
title_fullStr | Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data |
title_full_unstemmed | Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data |
title_short | Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data |
title_sort | maximizing interpretability and cost-effectiveness of surgical site infection (ssi) predictive models using feature-specific regularized logistic regression on preoperative temporal data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6399553/ https://www.ncbi.nlm.nih.gov/pubmed/30915154 http://dx.doi.org/10.1155/2019/2059851 |
work_keys_str_mv | AT kocbekprimoz maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT fijackonino maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT sogueroruizcristina maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT mikalsenkarløyvind maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT maveruros maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT povalejbrzanpetra maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT stozerandraz maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT jenssenrobert maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT skrøvsethsteinolav maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata AT stiglicgregor maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata |