Cargando…

Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation

BACKGROUND: Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously inv...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hearn, Jason, Van den Eynde, Jef, Chinni, Bhargava, Cedars, Ari, Gottlieb Sen, Danielle, Kutty, Shelby, Manlhiot, Cedric
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2023
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193221/ https://www.ncbi.nlm.nih.gov/pubmed/37133921 http://dx.doi.org/10.2196/40524

_version_	1785043794224218112
author	Hearn, Jason Van den Eynde, Jef Chinni, Bhargava Cedars, Ari Gottlieb Sen, Danielle Kutty, Shelby Manlhiot, Cedric
author_facet	Hearn, Jason Van den Eynde, Jef Chinni, Bhargava Cedars, Ari Gottlieb Sen, Danielle Kutty, Shelby Manlhiot, Cedric
author_sort	Hearn, Jason
collection	PubMed
description	BACKGROUND: Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously investigated. OBJECTIVE: The aim of this study is to simulate the effect of data degradation on the reliability of prediction models generated from those data and thus determine the extent to which lower device accuracy might or might not limit their use in clinical settings. METHODS: Using the Multilevel Monitoring of Activity and Sleep in Healthy People data set, which includes continuous free-living step count and heart rate data from 21 healthy volunteers, we trained a random forest model to predict cardiac competence. Model performance in 75 perturbed data sets with increasing missingness, noisiness, bias, and a combination of all 3 perturbations was compared to model performance for the unperturbed data set. RESULTS: The unperturbed data set achieved a mean root mean square error (RMSE) of 0.079 (SD 0.001) in predicting cardiac competence index. For all types of perturbations, RMSE remained stable up to 20%-30% perturbation. Above this level, RMSE started increasing and reached the point at which the model was no longer predictive at 80% for noise, 50% for missingness, and 35% for the combination of all perturbations. Introducing systematic bias in the underlying data had no effect on RMSE. CONCLUSIONS: In this proof-of-concept study, the performance of predictive models for cardiac competence generated from continuously acquired physiological data was relatively stable with declining quality of the source data. As such, lower accuracy of consumer-oriented wearable devices might not be an absolute contraindication for their use in clinical prediction models.
format	Online Article Text
id	pubmed-10193221
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-101932212023-05-19 Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation Hearn, Jason Van den Eynde, Jef Chinni, Bhargava Cedars, Ari Gottlieb Sen, Danielle Kutty, Shelby Manlhiot, Cedric JMIR Cardio Original Paper BACKGROUND: Limited data accuracy is often cited as a reason for caution in the integration of physiological data obtained from consumer-oriented wearable devices in care management pathways. The effect of decreasing accuracy on predictive models generated from these data has not been previously investigated. OBJECTIVE: The aim of this study is to simulate the effect of data degradation on the reliability of prediction models generated from those data and thus determine the extent to which lower device accuracy might or might not limit their use in clinical settings. METHODS: Using the Multilevel Monitoring of Activity and Sleep in Healthy People data set, which includes continuous free-living step count and heart rate data from 21 healthy volunteers, we trained a random forest model to predict cardiac competence. Model performance in 75 perturbed data sets with increasing missingness, noisiness, bias, and a combination of all 3 perturbations was compared to model performance for the unperturbed data set. RESULTS: The unperturbed data set achieved a mean root mean square error (RMSE) of 0.079 (SD 0.001) in predicting cardiac competence index. For all types of perturbations, RMSE remained stable up to 20%-30% perturbation. Above this level, RMSE started increasing and reached the point at which the model was no longer predictive at 80% for noise, 50% for missingness, and 35% for the combination of all perturbations. Introducing systematic bias in the underlying data had no effect on RMSE. CONCLUSIONS: In this proof-of-concept study, the performance of predictive models for cardiac competence generated from continuously acquired physiological data was relatively stable with declining quality of the source data. As such, lower accuracy of consumer-oriented wearable devices might not be an absolute contraindication for their use in clinical prediction models. JMIR Publications 2023-05-03 /pmc/articles/PMC10193221/ /pubmed/37133921 http://dx.doi.org/10.2196/40524 Text en ©Jason Hearn, Jef Van den Eynde, Bhargava Chinni, Ari Cedars, Danielle Gottlieb Sen, Shelby Kutty, Cedric Manlhiot. Originally published in JMIR Cardio (https://cardio.jmir.org), 03.05.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Cardio, is properly cited. The complete bibliographic information, a link to the original publication on https://cardio.jmir.org, as well as this copyright and license information must be included.
spellingShingle	Original Paper Hearn, Jason Van den Eynde, Jef Chinni, Bhargava Cedars, Ari Gottlieb Sen, Danielle Kutty, Shelby Manlhiot, Cedric Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation
title	Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation
title_full	Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation
title_fullStr	Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation
title_full_unstemmed	Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation
title_short	Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation
title_sort	data quality degradation on prediction models generated from continuous activity and heart rate monitoring: exploratory analysis using simulation
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193221/ https://www.ncbi.nlm.nih.gov/pubmed/37133921 http://dx.doi.org/10.2196/40524
work_keys_str_mv	AT hearnjason dataqualitydegradationonpredictionmodelsgeneratedfromcontinuousactivityandheartratemonitoringexploratoryanalysisusingsimulation AT vandeneyndejef dataqualitydegradationonpredictionmodelsgeneratedfromcontinuousactivityandheartratemonitoringexploratoryanalysisusingsimulation AT chinnibhargava dataqualitydegradationonpredictionmodelsgeneratedfromcontinuousactivityandheartratemonitoringexploratoryanalysisusingsimulation AT cedarsari dataqualitydegradationonpredictionmodelsgeneratedfromcontinuousactivityandheartratemonitoringexploratoryanalysisusingsimulation AT gottliebsendanielle dataqualitydegradationonpredictionmodelsgeneratedfromcontinuousactivityandheartratemonitoringexploratoryanalysisusingsimulation AT kuttyshelby dataqualitydegradationonpredictionmodelsgeneratedfromcontinuousactivityandheartratemonitoringexploratoryanalysisusingsimulation AT manlhiotcedric dataqualitydegradationonpredictionmodelsgeneratedfromcontinuousactivityandheartratemonitoringexploratoryanalysisusingsimulation

Data Quality Degradation on Prediction Models Generated From Continuous Activity and Heart Rate Monitoring: Exploratory Analysis Using Simulation

Ejemplares similares