Cargando…

Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data

This study describes a novel approach to solve the surgical site infection (SSI) classification problem. Feature engineering has traditionally been one of the most important steps in solving complex classification problems, especially in cases with temporal data. The described novel approach is base...

Descripción completa

Detalles Bibliográficos
Autores principales: Kocbek, Primoz, Fijacko, Nino, Soguero-Ruiz, Cristina, Mikalsen, Karl Øyvind, Maver, Uros, Povalej Brzan, Petra, Stozer, Andraz, Jenssen, Robert, Skrøvseth, Stein Olav, Stiglic, Gregor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6399553/
https://www.ncbi.nlm.nih.gov/pubmed/30915154
http://dx.doi.org/10.1155/2019/2059851
_version_ 1783399782788628480
author Kocbek, Primoz
Fijacko, Nino
Soguero-Ruiz, Cristina
Mikalsen, Karl Øyvind
Maver, Uros
Povalej Brzan, Petra
Stozer, Andraz
Jenssen, Robert
Skrøvseth, Stein Olav
Stiglic, Gregor
author_facet Kocbek, Primoz
Fijacko, Nino
Soguero-Ruiz, Cristina
Mikalsen, Karl Øyvind
Maver, Uros
Povalej Brzan, Petra
Stozer, Andraz
Jenssen, Robert
Skrøvseth, Stein Olav
Stiglic, Gregor
author_sort Kocbek, Primoz
collection PubMed
description This study describes a novel approach to solve the surgical site infection (SSI) classification problem. Feature engineering has traditionally been one of the most important steps in solving complex classification problems, especially in cases with temporal data. The described novel approach is based on abstraction of temporal data recorded in three temporal windows. Maximum likelihood L1-norm (lasso) regularization was used in penalized logistic regression to predict the onset of surgical site infection occurrence based on available patient blood testing results up to the day of surgery. Prior knowledge of predictors (blood tests) was integrated in the modelling by introduction of penalty factors depending on blood test prices and an early stopping parameter limiting the maximum number of selected features used in predictive modelling. Finally, solutions resulting in higher interpretability and cost-effectiveness were demonstrated. Using repeated holdout cross-validation, the baseline C-reactive protein (CRP) classifier achieved a mean AUC of 0.801, whereas our best full lasso model achieved a mean AUC of 0.956. Best model testing results were achieved for full lasso model with maximum number of features limited at 20 features with an AUC of 0.967. Presented models showed the potential to not only support domain experts in their decision making but could also prove invaluable for improvement in prediction of SSI occurrence, which may even help setting new guidelines in the field of preoperative SSI prevention and surveillance.
format Online
Article
Text
id pubmed-6399553
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-63995532019-03-26 Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data Kocbek, Primoz Fijacko, Nino Soguero-Ruiz, Cristina Mikalsen, Karl Øyvind Maver, Uros Povalej Brzan, Petra Stozer, Andraz Jenssen, Robert Skrøvseth, Stein Olav Stiglic, Gregor Comput Math Methods Med Research Article This study describes a novel approach to solve the surgical site infection (SSI) classification problem. Feature engineering has traditionally been one of the most important steps in solving complex classification problems, especially in cases with temporal data. The described novel approach is based on abstraction of temporal data recorded in three temporal windows. Maximum likelihood L1-norm (lasso) regularization was used in penalized logistic regression to predict the onset of surgical site infection occurrence based on available patient blood testing results up to the day of surgery. Prior knowledge of predictors (blood tests) was integrated in the modelling by introduction of penalty factors depending on blood test prices and an early stopping parameter limiting the maximum number of selected features used in predictive modelling. Finally, solutions resulting in higher interpretability and cost-effectiveness were demonstrated. Using repeated holdout cross-validation, the baseline C-reactive protein (CRP) classifier achieved a mean AUC of 0.801, whereas our best full lasso model achieved a mean AUC of 0.956. Best model testing results were achieved for full lasso model with maximum number of features limited at 20 features with an AUC of 0.967. Presented models showed the potential to not only support domain experts in their decision making but could also prove invaluable for improvement in prediction of SSI occurrence, which may even help setting new guidelines in the field of preoperative SSI prevention and surveillance. Hindawi 2019-02-19 /pmc/articles/PMC6399553/ /pubmed/30915154 http://dx.doi.org/10.1155/2019/2059851 Text en Copyright © 2019 Primoz Kocbek et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kocbek, Primoz
Fijacko, Nino
Soguero-Ruiz, Cristina
Mikalsen, Karl Øyvind
Maver, Uros
Povalej Brzan, Petra
Stozer, Andraz
Jenssen, Robert
Skrøvseth, Stein Olav
Stiglic, Gregor
Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data
title Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data
title_full Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data
title_fullStr Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data
title_full_unstemmed Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data
title_short Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data
title_sort maximizing interpretability and cost-effectiveness of surgical site infection (ssi) predictive models using feature-specific regularized logistic regression on preoperative temporal data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6399553/
https://www.ncbi.nlm.nih.gov/pubmed/30915154
http://dx.doi.org/10.1155/2019/2059851
work_keys_str_mv AT kocbekprimoz maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT fijackonino maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT sogueroruizcristina maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT mikalsenkarløyvind maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT maveruros maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT povalejbrzanpetra maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT stozerandraz maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT jenssenrobert maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT skrøvsethsteinolav maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata
AT stiglicgregor maximizinginterpretabilityandcosteffectivenessofsurgicalsiteinfectionssipredictivemodelsusingfeaturespecificregularizedlogisticregressiononpreoperativetemporaldata