Cargando…

Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied

The large MIMIC waveform dataset, sourced from intensive care units, has been used extensively for the development of Photoplethysmography (PPG) based blood pressure (BP) estimation algorithms. Yet, because the data comes from patients in severe conditions—often under the effect of drugs—it is regul...

Descripción completa

Detalles Bibliográficos
Autores principales: Weber-Boisvert, Guillaume, Gosselin, Benoit, Sandberg, Frida
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10017741/
https://www.ncbi.nlm.nih.gov/pubmed/36935753
http://dx.doi.org/10.3389/fphys.2023.1126957
_version_ 1784907656752791552
author Weber-Boisvert, Guillaume
Gosselin, Benoit
Sandberg, Frida
author_facet Weber-Boisvert, Guillaume
Gosselin, Benoit
Sandberg, Frida
author_sort Weber-Boisvert, Guillaume
collection PubMed
description The large MIMIC waveform dataset, sourced from intensive care units, has been used extensively for the development of Photoplethysmography (PPG) based blood pressure (BP) estimation algorithms. Yet, because the data comes from patients in severe conditions—often under the effect of drugs—it is regularly noted that the relationship between BP and PPG signal characteristics may be anomalous, a claim that we investigate here. A sample of 12,000 records from the MIMIC waveform dataset was stacked up against the 219 records of the PPG-BP dataset, an alternative public dataset obtained under controlled experimental conditions. The distribution of systolic and diastolic BP data and 31 PPG pulse morphological features was first compared between datasets. Then, the correlation between features and BP, as well as between the features themselves, was analysed. Finally, regression models were trained for each dataset and validated against the other. Statistical analysis showed significant [Formula: see text] differences between the datasets in diastolic BP and in 20 out of 31 features when adjusting for heart rate differences. The eight features showing the highest rank correlation [Formula: see text] to systolic BP in PPG-BP all displayed muted correlation levels [Formula: see text] in MIMIC. Regression tests showed twice higher baseline predictive power with PPG-BP than with MIMIC. Cross-dataset regression displayed a practically complete loss of predictive power for all models. The differences between the MIMIC and PPG-BP dataset exposed in this study suggest that BP estimation models based on the MIMIC dataset have reduced predictive power on the general population.
format Online
Article
Text
id pubmed-10017741
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100177412023-03-17 Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied Weber-Boisvert, Guillaume Gosselin, Benoit Sandberg, Frida Front Physiol Physiology The large MIMIC waveform dataset, sourced from intensive care units, has been used extensively for the development of Photoplethysmography (PPG) based blood pressure (BP) estimation algorithms. Yet, because the data comes from patients in severe conditions—often under the effect of drugs—it is regularly noted that the relationship between BP and PPG signal characteristics may be anomalous, a claim that we investigate here. A sample of 12,000 records from the MIMIC waveform dataset was stacked up against the 219 records of the PPG-BP dataset, an alternative public dataset obtained under controlled experimental conditions. The distribution of systolic and diastolic BP data and 31 PPG pulse morphological features was first compared between datasets. Then, the correlation between features and BP, as well as between the features themselves, was analysed. Finally, regression models were trained for each dataset and validated against the other. Statistical analysis showed significant [Formula: see text] differences between the datasets in diastolic BP and in 20 out of 31 features when adjusting for heart rate differences. The eight features showing the highest rank correlation [Formula: see text] to systolic BP in PPG-BP all displayed muted correlation levels [Formula: see text] in MIMIC. Regression tests showed twice higher baseline predictive power with PPG-BP than with MIMIC. Cross-dataset regression displayed a practically complete loss of predictive power for all models. The differences between the MIMIC and PPG-BP dataset exposed in this study suggest that BP estimation models based on the MIMIC dataset have reduced predictive power on the general population. Frontiers Media S.A. 2023-03-02 /pmc/articles/PMC10017741/ /pubmed/36935753 http://dx.doi.org/10.3389/fphys.2023.1126957 Text en Copyright © 2023 Weber-Boisvert, Gosselin and Sandberg. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Physiology
Weber-Boisvert, Guillaume
Gosselin, Benoit
Sandberg, Frida
Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied
title Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied
title_full Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied
title_fullStr Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied
title_full_unstemmed Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied
title_short Intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: Generalization not guarantied
title_sort intensive care photoplethysmogram datasets and machine-learning for blood pressure estimation: generalization not guarantied
topic Physiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10017741/
https://www.ncbi.nlm.nih.gov/pubmed/36935753
http://dx.doi.org/10.3389/fphys.2023.1126957
work_keys_str_mv AT weberboisvertguillaume intensivecarephotoplethysmogramdatasetsandmachinelearningforbloodpressureestimationgeneralizationnotguarantied
AT gosselinbenoit intensivecarephotoplethysmogramdatasetsandmachinelearningforbloodpressureestimationgeneralizationnotguarantied
AT sandbergfrida intensivecarephotoplethysmogramdatasetsandmachinelearningforbloodpressureestimationgeneralizationnotguarantied