Cargando…

Predicting the occurrence of surgical site infections using text mining and machine learning

In this study we propose the use of text mining and machine learning methods to predict and detect Surgical Site Infections (SSIs) using textual descriptions of surgeries and post-operative patients’ records, mined from the database of a high complexity University hospital. SSIs are among the most c...

Descripción completa

Detalles Bibliográficos
Autores principales: da Silva, Daniel A., ten Caten, Carla S., dos Santos, Rodrigo P., Fogliatto, Flavio S., Hsuan, Juliana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6910696/
https://www.ncbi.nlm.nih.gov/pubmed/31834905
http://dx.doi.org/10.1371/journal.pone.0226272
_version_ 1783479138722512896
author da Silva, Daniel A.
ten Caten, Carla S.
dos Santos, Rodrigo P.
Fogliatto, Flavio S.
Hsuan, Juliana
author_facet da Silva, Daniel A.
ten Caten, Carla S.
dos Santos, Rodrigo P.
Fogliatto, Flavio S.
Hsuan, Juliana
author_sort da Silva, Daniel A.
collection PubMed
description In this study we propose the use of text mining and machine learning methods to predict and detect Surgical Site Infections (SSIs) using textual descriptions of surgeries and post-operative patients’ records, mined from the database of a high complexity University hospital. SSIs are among the most common adverse events experienced by hospitalized patients; preventing such events is fundamental to ensure patients’ safety. Knowledge on SSI occurrence rates may also be useful in preventing future episodes. We analyzed 15,479 surgery descriptions and post-operative records testing different preprocessing strategies and the following machine learning algorithms: Linear SVC, Logistic Regression, Multinomial Naive Bayes, Nearest Centroid, Random Forest, Stochastic Gradient Descent, and Support Vector Classification (SVC). For prediction purposes, the best result was obtained using the Stochastic Gradient Descent method (79.7% ROC-AUC); for detection, Logistic Regression yielded the best performance (80.6% ROC-AUC).
format Online
Article
Text
id pubmed-6910696
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69106962019-12-27 Predicting the occurrence of surgical site infections using text mining and machine learning da Silva, Daniel A. ten Caten, Carla S. dos Santos, Rodrigo P. Fogliatto, Flavio S. Hsuan, Juliana PLoS One Research Article In this study we propose the use of text mining and machine learning methods to predict and detect Surgical Site Infections (SSIs) using textual descriptions of surgeries and post-operative patients’ records, mined from the database of a high complexity University hospital. SSIs are among the most common adverse events experienced by hospitalized patients; preventing such events is fundamental to ensure patients’ safety. Knowledge on SSI occurrence rates may also be useful in preventing future episodes. We analyzed 15,479 surgery descriptions and post-operative records testing different preprocessing strategies and the following machine learning algorithms: Linear SVC, Logistic Regression, Multinomial Naive Bayes, Nearest Centroid, Random Forest, Stochastic Gradient Descent, and Support Vector Classification (SVC). For prediction purposes, the best result was obtained using the Stochastic Gradient Descent method (79.7% ROC-AUC); for detection, Logistic Regression yielded the best performance (80.6% ROC-AUC). Public Library of Science 2019-12-13 /pmc/articles/PMC6910696/ /pubmed/31834905 http://dx.doi.org/10.1371/journal.pone.0226272 Text en © 2019 da Silva et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
da Silva, Daniel A.
ten Caten, Carla S.
dos Santos, Rodrigo P.
Fogliatto, Flavio S.
Hsuan, Juliana
Predicting the occurrence of surgical site infections using text mining and machine learning
title Predicting the occurrence of surgical site infections using text mining and machine learning
title_full Predicting the occurrence of surgical site infections using text mining and machine learning
title_fullStr Predicting the occurrence of surgical site infections using text mining and machine learning
title_full_unstemmed Predicting the occurrence of surgical site infections using text mining and machine learning
title_short Predicting the occurrence of surgical site infections using text mining and machine learning
title_sort predicting the occurrence of surgical site infections using text mining and machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6910696/
https://www.ncbi.nlm.nih.gov/pubmed/31834905
http://dx.doi.org/10.1371/journal.pone.0226272
work_keys_str_mv AT dasilvadaniela predictingtheoccurrenceofsurgicalsiteinfectionsusingtextminingandmachinelearning
AT tencatencarlas predictingtheoccurrenceofsurgicalsiteinfectionsusingtextminingandmachinelearning
AT dossantosrodrigop predictingtheoccurrenceofsurgicalsiteinfectionsusingtextminingandmachinelearning
AT fogliattoflavios predictingtheoccurrenceofsurgicalsiteinfectionsusingtextminingandmachinelearning
AT hsuanjuliana predictingtheoccurrenceofsurgicalsiteinfectionsusingtextminingandmachinelearning