Cargando…
Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury
Manuscripts that have successfully used machine learning (ML) to predict a variety of perioperative outcomes often use only a limited number of features selected by a clinician. We hypothesized that techniques leveraging a broad set of features for patient laboratory results, medications, and the su...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205878/ https://www.ncbi.nlm.nih.gov/pubmed/35715454 http://dx.doi.org/10.1038/s41598-022-13879-7 |
_version_ | 1784729222309216256 |
---|---|
author | Hofer, Ira S. Kupina, Marina Laddaran, Lori Halperin, Eran |
author_facet | Hofer, Ira S. Kupina, Marina Laddaran, Lori Halperin, Eran |
author_sort | Hofer, Ira S. |
collection | PubMed |
description | Manuscripts that have successfully used machine learning (ML) to predict a variety of perioperative outcomes often use only a limited number of features selected by a clinician. We hypothesized that techniques leveraging a broad set of features for patient laboratory results, medications, and the surgical procedure name would improve performance as compared to a more limited set of features chosen by clinicians. Feature vectors for laboratory results included 702 features total derived from 39 laboratory tests, medications consisted of a binary flag for 126 commonly used medications, procedure name used the Word2Vec package for create a vector of length 100. Nine models were trained: baseline features, one for each of the three types of data Baseline + Each data type, (all features, and then all features with feature reduction algorithm. Across both outcomes the models that contained all features (model 8) (Mortality ROC-AUC 94.32 ± 1.01, PR-AUC 36.80 ± 5.10 AKI ROC-AUC 92.45 ± 0.64, PR-AUC 76.22 ± 1.95) was superior to models with only subsets of features. Featurization techniques leveraging a broad away of clinical data can improve performance of perioperative prediction models. |
format | Online Article Text |
id | pubmed-9205878 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-92058782022-06-19 Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury Hofer, Ira S. Kupina, Marina Laddaran, Lori Halperin, Eran Sci Rep Article Manuscripts that have successfully used machine learning (ML) to predict a variety of perioperative outcomes often use only a limited number of features selected by a clinician. We hypothesized that techniques leveraging a broad set of features for patient laboratory results, medications, and the surgical procedure name would improve performance as compared to a more limited set of features chosen by clinicians. Feature vectors for laboratory results included 702 features total derived from 39 laboratory tests, medications consisted of a binary flag for 126 commonly used medications, procedure name used the Word2Vec package for create a vector of length 100. Nine models were trained: baseline features, one for each of the three types of data Baseline + Each data type, (all features, and then all features with feature reduction algorithm. Across both outcomes the models that contained all features (model 8) (Mortality ROC-AUC 94.32 ± 1.01, PR-AUC 36.80 ± 5.10 AKI ROC-AUC 92.45 ± 0.64, PR-AUC 76.22 ± 1.95) was superior to models with only subsets of features. Featurization techniques leveraging a broad away of clinical data can improve performance of perioperative prediction models. Nature Publishing Group UK 2022-06-17 /pmc/articles/PMC9205878/ /pubmed/35715454 http://dx.doi.org/10.1038/s41598-022-13879-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Hofer, Ira S. Kupina, Marina Laddaran, Lori Halperin, Eran Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury |
title | Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury |
title_full | Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury |
title_fullStr | Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury |
title_full_unstemmed | Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury |
title_short | Integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury |
title_sort | integration of feature vectors from raw laboratory, medication and procedure names improves the precision and recall of models to predict postoperative mortality and acute kidney injury |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205878/ https://www.ncbi.nlm.nih.gov/pubmed/35715454 http://dx.doi.org/10.1038/s41598-022-13879-7 |
work_keys_str_mv | AT hoferiras integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury AT kupinamarina integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury AT laddaranlori integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury AT halperineran integrationoffeaturevectorsfromrawlaboratorymedicationandprocedurenamesimprovestheprecisionandrecallofmodelstopredictpostoperativemortalityandacutekidneyinjury |