Cargando…

Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury

Manuscripts that have successfully used machine learning (ML) to predict a variety of perioperative outcomes often use only a limited number of features selected by a clinician. We hypothesized that techniques leveraging a broad set of features for patient laboratory results, medications, and the su...

Descripción completa

Detalles Bibliográficos
Autores principales: Hofer, Ira S., Kupina, Marina, Laddaran, Lori, Halperin, Eran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205878/
https://www.ncbi.nlm.nih.gov/pubmed/35715454
http://dx.doi.org/10.1038/s41598-022-13879-7
_version_ 1784729222309216256
author Hofer, Ira S.
Kupina, Marina
Laddaran, Lori
Halperin, Eran
author_facet Hofer, Ira S.
Kupina, Marina
Laddaran, Lori
Halperin, Eran
author_sort Hofer, Ira S.
collection PubMed
description Manuscripts that have successfully used machine learning (ML) to predict a variety of perioperative outcomes often use only a limited number of features selected by a clinician. We hypothesized that techniques leveraging a broad set of features for patient laboratory results, medications, and the surgical procedure name would improve performance as compared to a more limited set of features chosen by clinicians. Feature vectors for laboratory results included 702 features total derived from 39 laboratory tests, medications consisted of a binary flag for 126 commonly used medications, procedure name used the Word2Vec package for create a vector of length 100. Nine models were trained: baseline features, one for each of the three types of data Baseline + Each data type, (all features, and then all features with feature reduction algorithm. Across both outcomes the models that contained all features (model 8) (Mortality ROC-AUC 94.32 ± 1.01, PR-AUC 36.80 ± 5.10 AKI ROC-AUC 92.45 ± 0.64, PR-AUC 76.22 ± 1.95) was superior to models with only subsets of features. Featurization techniques leveraging a broad away of clinical data can improve performance of perioperative prediction models.
format Online
Article
Text
id pubmed-9205878
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92058782022-06-19 Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury Hofer, Ira S. Kupina, Marina Laddaran, Lori Halperin, Eran Sci Rep Article Manuscripts that have successfully used machine learning (ML) to predict a variety of perioperative outcomes often use only a limited number of features selected by a clinician. We hypothesized that techniques leveraging a broad set of features for patient laboratory results, medications, and the surgical procedure name would improve performance as compared to a more limited set of features chosen by clinicians. Feature vectors for laboratory results included 702 features total derived from 39 laboratory tests, medications consisted of a binary flag for 126 commonly used medications, procedure name used the Word2Vec package for create a vector of length 100. Nine models were trained: baseline features, one for each of the three types of data Baseline + Each data type, (all features, and then all features with feature reduction algorithm. Across both outcomes the models that contained all features (model 8) (Mortality ROC-AUC 94.32 ± 1.01, PR-AUC 36.80 ± 5.10 AKI ROC-AUC 92.45 ± 0.64, PR-AUC 76.22 ± 1.95) was superior to models with only subsets of features. Featurization techniques leveraging a broad away of clinical data can improve performance of perioperative prediction models. Nature Publishing Group UK 2022-06-17 /pmc/articles/PMC9205878/ /pubmed/35715454 http://dx.doi.org/10.1038/s41598-022-13879-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Hofer, Ira S.
Kupina, Marina
Laddaran, Lori
Halperin, Eran
Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
title Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
title_full Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
title_fullStr Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
title_full_unstemmed Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
title_short Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
title_sort integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205878/
https://www.ncbi.nlm.nih.gov/pubmed/35715454
http://dx.doi.org/10.1038/s41598-022-13879-7
work_keys_str_mv AT hoferiras integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury
AT kupinamarina integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury
AT laddaranlori integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury
AT halperineran integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury