Cargando…

Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study

BACKGROUND: Population based surveillance of surgical site infections (SSIs) requires precise case-finding strategies. We sought to develop and validate machine learning models to automate the process of complex (deep incisional/organ space) SSIs case detection. METHODS: This retrospective cohort st...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Guosong, Cheligeer, Cheligeer, Southern, Danielle A., Martin, Elliot A., Xu, Yuan, Leal, Jenine, Ellison, Jennifer, Bush, Kathryn, Williamson, Tyler, Quan, Hude, Eastwood, Cathy A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10474760/
https://www.ncbi.nlm.nih.gov/pubmed/37658409
http://dx.doi.org/10.1186/s13756-023-01294-0
_version_ 1785100572420997120
author Wu, Guosong
Cheligeer, Cheligeer
Southern, Danielle A.
Martin, Elliot A.
Xu, Yuan
Leal, Jenine
Ellison, Jennifer
Bush, Kathryn
Williamson, Tyler
Quan, Hude
Eastwood, Cathy A.
author_facet Wu, Guosong
Cheligeer, Cheligeer
Southern, Danielle A.
Martin, Elliot A.
Xu, Yuan
Leal, Jenine
Ellison, Jennifer
Bush, Kathryn
Williamson, Tyler
Quan, Hude
Eastwood, Cathy A.
author_sort Wu, Guosong
collection PubMed
description BACKGROUND: Population based surveillance of surgical site infections (SSIs) requires precise case-finding strategies. We sought to develop and validate machine learning models to automate the process of complex (deep incisional/organ space) SSIs case detection. METHODS: This retrospective cohort study included adult patients (age ≥ 18 years) admitted to Calgary, Canada acute care hospitals who underwent primary total elective hip (THA) or knee (TKA) arthroplasty between Jan 1st, 2013 and Aug 31st, 2020. True SSI conditions were judged by the Alberta Health Services Infection Prevention and Control (IPC) program staff. Using the IPC cases as labels, we developed and validated nine XGBoost models to identify deep incisional SSIs, organ space SSIs and complex SSIs using administrative data, electronic medical records (EMR) free text data, and both. The performance of machine learning models was assessed by sensitivity, specificity, positive predictive value, negative predictive value, F1 score, the area under the receiver operating characteristic curve (ROC AUC) and the area under the precision–recall curve (PR AUC). In addition, a bootstrap 95% confidence interval (95% CI) was calculated. RESULTS: There were 22,059 unique patients with 27,360 hospital admissions resulting in 88,351 days of hospital stay. This included 16,561 (60.5%) TKA and 10,799 (39.5%) THA procedures. There were 235 ascertained SSIs. Of them, 77 (32.8%) were superficial incisional SSIs, 57 (24.3%) were deep incisional SSIs, and 101 (42.9%) were organ space SSIs. The incidence rates were 0.37 for superficial incisional SSIs, 0.21 for deep incisional SSIs, 0.37 for organ space and 0.58 for complex SSIs per 100 surgical procedures, respectively. The optimal XGBoost models using administrative data and text data combined achieved a ROC AUC of 0.906 (95% CI 0.835–0.978), PR AUC of 0.637 (95% CI 0.528–0.746), and F1 score of 0.79 (0.67–0.90). CONCLUSIONS: Our findings suggest machine learning models derived from administrative data and EMR text data achieved high performance and can be used to automate the detection of complex SSIs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13756-023-01294-0.
format Online
Article
Text
id pubmed-10474760
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104747602023-09-03 Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study Wu, Guosong Cheligeer, Cheligeer Southern, Danielle A. Martin, Elliot A. Xu, Yuan Leal, Jenine Ellison, Jennifer Bush, Kathryn Williamson, Tyler Quan, Hude Eastwood, Cathy A. Antimicrob Resist Infect Control Research BACKGROUND: Population based surveillance of surgical site infections (SSIs) requires precise case-finding strategies. We sought to develop and validate machine learning models to automate the process of complex (deep incisional/organ space) SSIs case detection. METHODS: This retrospective cohort study included adult patients (age ≥ 18 years) admitted to Calgary, Canada acute care hospitals who underwent primary total elective hip (THA) or knee (TKA) arthroplasty between Jan 1st, 2013 and Aug 31st, 2020. True SSI conditions were judged by the Alberta Health Services Infection Prevention and Control (IPC) program staff. Using the IPC cases as labels, we developed and validated nine XGBoost models to identify deep incisional SSIs, organ space SSIs and complex SSIs using administrative data, electronic medical records (EMR) free text data, and both. The performance of machine learning models was assessed by sensitivity, specificity, positive predictive value, negative predictive value, F1 score, the area under the receiver operating characteristic curve (ROC AUC) and the area under the precision–recall curve (PR AUC). In addition, a bootstrap 95% confidence interval (95% CI) was calculated. RESULTS: There were 22,059 unique patients with 27,360 hospital admissions resulting in 88,351 days of hospital stay. This included 16,561 (60.5%) TKA and 10,799 (39.5%) THA procedures. There were 235 ascertained SSIs. Of them, 77 (32.8%) were superficial incisional SSIs, 57 (24.3%) were deep incisional SSIs, and 101 (42.9%) were organ space SSIs. The incidence rates were 0.37 for superficial incisional SSIs, 0.21 for deep incisional SSIs, 0.37 for organ space and 0.58 for complex SSIs per 100 surgical procedures, respectively. The optimal XGBoost models using administrative data and text data combined achieved a ROC AUC of 0.906 (95% CI 0.835–0.978), PR AUC of 0.637 (95% CI 0.528–0.746), and F1 score of 0.79 (0.67–0.90). CONCLUSIONS: Our findings suggest machine learning models derived from administrative data and EMR text data achieved high performance and can be used to automate the detection of complex SSIs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13756-023-01294-0. BioMed Central 2023-09-02 /pmc/articles/PMC10474760/ /pubmed/37658409 http://dx.doi.org/10.1186/s13756-023-01294-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wu, Guosong
Cheligeer, Cheligeer
Southern, Danielle A.
Martin, Elliot A.
Xu, Yuan
Leal, Jenine
Ellison, Jennifer
Bush, Kathryn
Williamson, Tyler
Quan, Hude
Eastwood, Cathy A.
Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study
title Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study
title_full Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study
title_fullStr Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study
title_full_unstemmed Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study
title_short Development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study
title_sort development of machine learning models for the detection of surgical site infections following total hip and knee arthroplasty: a multicenter cohort study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10474760/
https://www.ncbi.nlm.nih.gov/pubmed/37658409
http://dx.doi.org/10.1186/s13756-023-01294-0
work_keys_str_mv AT wuguosong developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT cheligeercheligeer developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT southerndaniellea developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT martinelliota developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT xuyuan developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT lealjenine developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT ellisonjennifer developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT bushkathryn developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT williamsontyler developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT quanhude developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy
AT eastwoodcathya developmentofmachinelearningmodelsforthedetectionofsurgicalsiteinfectionsfollowingtotalhipandkneearthroplastyamulticentercohortstudy