Cargando…

Increasing efficiency of SVMp+ for handling missing values in healthcare prediction

Missing data presents a challenge for machine learning applications specifically when utilizing electronic health records to develop clinical decision support systems. The lack of these values is due in part to the complex nature of clinical data in which the content is personalized to each patient....

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yufeng, Gao, Zijun, Wittrup, Emily, Gryak, Jonathan, Najarian, Kayvan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10309617/
https://www.ncbi.nlm.nih.gov/pubmed/37384608
http://dx.doi.org/10.1371/journal.pdig.0000281
_version_ 1785066476321898496
author Zhang, Yufeng
Gao, Zijun
Wittrup, Emily
Gryak, Jonathan
Najarian, Kayvan
author_facet Zhang, Yufeng
Gao, Zijun
Wittrup, Emily
Gryak, Jonathan
Najarian, Kayvan
author_sort Zhang, Yufeng
collection PubMed
description Missing data presents a challenge for machine learning applications specifically when utilizing electronic health records to develop clinical decision support systems. The lack of these values is due in part to the complex nature of clinical data in which the content is personalized to each patient. Several methods have been developed to handle this issue, such as imputation or complete case analysis, but their limitations restrict the solidity of findings. However, recent studies have explored how using some features as fully available privileged information can increase model performance including in SVM. Building on this insight, we propose a computationally efficient kernel SVM-based framework (l(2)-SVMp+) that leverages partially available privileged information to guide model construction. Our experiments validated the superiority of l(2)-SVMp+ over common approaches for handling missingness and previous implementations of SVMp+ in both digit recognition, disease classification and patient readmission prediction tasks. The performance improves as the percentage of available privileged information increases. Our results showcase the capability of l(2)-SVMp+ to handle incomplete but important features in real-world medical applications, surpassing traditional SVMs that lack privileged information. Additionally, l(2)-SVMp+ achieves comparable or superior model performance compared to imputed privileged features.
format Online
Article
Text
id pubmed-10309617
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103096172023-06-30 Increasing efficiency of SVMp+ for handling missing values in healthcare prediction Zhang, Yufeng Gao, Zijun Wittrup, Emily Gryak, Jonathan Najarian, Kayvan PLOS Digit Health Research Article Missing data presents a challenge for machine learning applications specifically when utilizing electronic health records to develop clinical decision support systems. The lack of these values is due in part to the complex nature of clinical data in which the content is personalized to each patient. Several methods have been developed to handle this issue, such as imputation or complete case analysis, but their limitations restrict the solidity of findings. However, recent studies have explored how using some features as fully available privileged information can increase model performance including in SVM. Building on this insight, we propose a computationally efficient kernel SVM-based framework (l(2)-SVMp+) that leverages partially available privileged information to guide model construction. Our experiments validated the superiority of l(2)-SVMp+ over common approaches for handling missingness and previous implementations of SVMp+ in both digit recognition, disease classification and patient readmission prediction tasks. The performance improves as the percentage of available privileged information increases. Our results showcase the capability of l(2)-SVMp+ to handle incomplete but important features in real-world medical applications, surpassing traditional SVMs that lack privileged information. Additionally, l(2)-SVMp+ achieves comparable or superior model performance compared to imputed privileged features. Public Library of Science 2023-06-29 /pmc/articles/PMC10309617/ /pubmed/37384608 http://dx.doi.org/10.1371/journal.pdig.0000281 Text en © 2023 Zhang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Yufeng
Gao, Zijun
Wittrup, Emily
Gryak, Jonathan
Najarian, Kayvan
Increasing efficiency of SVMp+ for handling missing values in healthcare prediction
title Increasing efficiency of SVMp+ for handling missing values in healthcare prediction
title_full Increasing efficiency of SVMp+ for handling missing values in healthcare prediction
title_fullStr Increasing efficiency of SVMp+ for handling missing values in healthcare prediction
title_full_unstemmed Increasing efficiency of SVMp+ for handling missing values in healthcare prediction
title_short Increasing efficiency of SVMp+ for handling missing values in healthcare prediction
title_sort increasing efficiency of svmp+ for handling missing values in healthcare prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10309617/
https://www.ncbi.nlm.nih.gov/pubmed/37384608
http://dx.doi.org/10.1371/journal.pdig.0000281
work_keys_str_mv AT zhangyufeng increasingefficiencyofsvmpforhandlingmissingvaluesinhealthcareprediction
AT gaozijun increasingefficiencyofsvmpforhandlingmissingvaluesinhealthcareprediction
AT wittrupemily increasingefficiencyofsvmpforhandlingmissingvaluesinhealthcareprediction
AT gryakjonathan increasingefficiencyofsvmpforhandlingmissingvaluesinhealthcareprediction
AT najariankayvan increasingefficiencyofsvmpforhandlingmissingvaluesinhealthcareprediction