Cargando…

Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization

Background: The success of many machine learning applications depends on knowledge about the relationship between the input data and the task of interest (output), hindering the application of machine learning to novel tasks. End-to-end deep learning, which does not require intermediate feature engi...

Descripción completa

Detalles Bibliográficos
Autores principales: Mwaniki, Paul, Kamanu, Timothy, Akech, Samuel, Dunsmuir, Dustin, Ansermino, J. Mark, Eijkemans, M.J.C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280037/
https://www.ncbi.nlm.nih.gov/pubmed/37346816
http://dx.doi.org/10.12688/wellcomeopenres.17148.2
_version_ 1785060718945501184
author Mwaniki, Paul
Kamanu, Timothy
Akech, Samuel
Dunsmuir, Dustin
Ansermino, J. Mark
Eijkemans, M.J.C
author_facet Mwaniki, Paul
Kamanu, Timothy
Akech, Samuel
Dunsmuir, Dustin
Ansermino, J. Mark
Eijkemans, M.J.C
author_sort Mwaniki, Paul
collection PubMed
description Background: The success of many machine learning applications depends on knowledge about the relationship between the input data and the task of interest (output), hindering the application of machine learning to novel tasks. End-to-end deep learning, which does not require intermediate feature engineering, has been recommended to overcome this challenge but end-to-end deep learning models require large labelled training data sets often unavailable in many medical applications. In this study, we trained self-supervised learning (SSL) models for automatic feature extraction from raw photoplethysmography (PPG) obtained using a pulse oximeter, with the aim of predicting paediatric hospitalization.  Methods: We compared logistic regression models fitted using features extracted using SSL with models trained using both clinical and SSL features. In addition, we compared end-to-end deep learning models initialized randomly or using weights from the SSL models. We also compared the performance of SSL models trained on labelled data alone (n=1,031) with SSL trained using both labelled and unlabelled signals (n=7,578). Results: Logistic regression models were more predictive of hospitalization when trained on features extracted using labelled PPG signals only compared to SSL models trained on both labelled and unlabelled signals (AUC 0.83 vs 0.80). However, features extracted using SSL model trained on both labelled and unlabelled PPG signals were more predictive of hospitalization when concatenated with clinical features (AUC 0.89 vs 0.87). The end-to-end deep learning model had an AUC of 0.80 when initialized using the SSL model trained on all PPG signals, 0.77 when initialized using SSL trained on labelled data only, and 0.73 when initialized randomly. Conclusions: This study shows that SSL can extract features from PPG signals that are predictive of hospitalization or initialize end-to-end deep learning models. Furthermore, SSL can leverage larger unlabelled data sets to improve performance of models fitted using small labelled data sets.
format Online
Article
Text
id pubmed-10280037
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-102800372023-06-21 Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization Mwaniki, Paul Kamanu, Timothy Akech, Samuel Dunsmuir, Dustin Ansermino, J. Mark Eijkemans, M.J.C Wellcome Open Res Research Article Background: The success of many machine learning applications depends on knowledge about the relationship between the input data and the task of interest (output), hindering the application of machine learning to novel tasks. End-to-end deep learning, which does not require intermediate feature engineering, has been recommended to overcome this challenge but end-to-end deep learning models require large labelled training data sets often unavailable in many medical applications. In this study, we trained self-supervised learning (SSL) models for automatic feature extraction from raw photoplethysmography (PPG) obtained using a pulse oximeter, with the aim of predicting paediatric hospitalization.  Methods: We compared logistic regression models fitted using features extracted using SSL with models trained using both clinical and SSL features. In addition, we compared end-to-end deep learning models initialized randomly or using weights from the SSL models. We also compared the performance of SSL models trained on labelled data alone (n=1,031) with SSL trained using both labelled and unlabelled signals (n=7,578). Results: Logistic regression models were more predictive of hospitalization when trained on features extracted using labelled PPG signals only compared to SSL models trained on both labelled and unlabelled signals (AUC 0.83 vs 0.80). However, features extracted using SSL model trained on both labelled and unlabelled PPG signals were more predictive of hospitalization when concatenated with clinical features (AUC 0.89 vs 0.87). The end-to-end deep learning model had an AUC of 0.80 when initialized using the SSL model trained on all PPG signals, 0.77 when initialized using SSL trained on labelled data only, and 0.73 when initialized randomly. Conclusions: This study shows that SSL can extract features from PPG signals that are predictive of hospitalization or initialize end-to-end deep learning models. Furthermore, SSL can leverage larger unlabelled data sets to improve performance of models fitted using small labelled data sets. F1000 Research Limited 2023-02-01 /pmc/articles/PMC10280037/ /pubmed/37346816 http://dx.doi.org/10.12688/wellcomeopenres.17148.2 Text en Copyright: © 2023 Mwaniki P et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mwaniki, Paul
Kamanu, Timothy
Akech, Samuel
Dunsmuir, Dustin
Ansermino, J. Mark
Eijkemans, M.J.C
Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization
title Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization
title_full Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization
title_fullStr Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization
title_full_unstemmed Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization
title_short Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization
title_sort using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280037/
https://www.ncbi.nlm.nih.gov/pubmed/37346816
http://dx.doi.org/10.12688/wellcomeopenres.17148.2
work_keys_str_mv AT mwanikipaul usingselfsupervisedfeaturelearningtoimprovetheuseofpulseoximetersignalstopredictpaediatrichospitalization
AT kamanutimothy usingselfsupervisedfeaturelearningtoimprovetheuseofpulseoximetersignalstopredictpaediatrichospitalization
AT akechsamuel usingselfsupervisedfeaturelearningtoimprovetheuseofpulseoximetersignalstopredictpaediatrichospitalization
AT dunsmuirdustin usingselfsupervisedfeaturelearningtoimprovetheuseofpulseoximetersignalstopredictpaediatrichospitalization
AT anserminojmark usingselfsupervisedfeaturelearningtoimprovetheuseofpulseoximetersignalstopredictpaediatrichospitalization
AT eijkemansmjc usingselfsupervisedfeaturelearningtoimprovetheuseofpulseoximetersignalstopredictpaediatrichospitalization