Cargando…
Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort
BACKGROUND: Between 2013 and 2015, the UK Biobank collected accelerometer traces from 103,712 volunteers aged between 40 and 69 years using wrist-worn triaxial accelerometers for 1 week. This data set has been used in the past to verify that individuals with chronic diseases exhibit reduced activity...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8080299/ https://www.ncbi.nlm.nih.gov/pubmed/33739298 http://dx.doi.org/10.2196/23364 |
_version_ | 1783685397835939840 |
---|---|
author | Lam, Benjamin Catt, Michael Cassidy, Sophie Bacardit, Jaume Darke, Philip Butterfield, Sam Alshabrawy, Ossama Trenell, Michael Missier, Paolo |
author_facet | Lam, Benjamin Catt, Michael Cassidy, Sophie Bacardit, Jaume Darke, Philip Butterfield, Sam Alshabrawy, Ossama Trenell, Michael Missier, Paolo |
author_sort | Lam, Benjamin |
collection | PubMed |
description | BACKGROUND: Between 2013 and 2015, the UK Biobank collected accelerometer traces from 103,712 volunteers aged between 40 and 69 years using wrist-worn triaxial accelerometers for 1 week. This data set has been used in the past to verify that individuals with chronic diseases exhibit reduced activity levels compared with healthy populations. However, the data set is likely to be noisy, as the devices were allocated to participants without a set of inclusion criteria, and the traces reflect free-living conditions. OBJECTIVE: This study aims to determine the extent to which accelerometer traces can be used to distinguish individuals with type 2 diabetes (T2D) from normoglycemic controls and to quantify their limitations. METHODS: Machine learning classifiers were trained using different feature sets to segregate individuals with T2D from normoglycemic individuals. Multiple criteria, based on a combination of self-assessment UK Biobank variables and primary care health records linked to UK Biobank participants, were used to identify 3103 individuals with T2D in this population. The remaining nondiabetic 19,852 participants were further scored on their physical activity impairment severity based on other conditions found in their primary care data, and those deemed likely physically impaired at the time were excluded. Physical activity features were first extracted from the raw accelerometer traces data set for each participant using an algorithm that extends the previously developed Biobank Accelerometry Analysis toolkit from Oxford University. These features were complemented by a selected collection of sociodemographic and lifestyle features available from UK Biobank. RESULTS: We tested 3 types of classifiers, with an area under the receiver operating characteristic curve (AUC) close to 0.86 (95% CI 0.85-0.87) for all 3 classifiers and F1 scores in the range of 0.80-0.82 for T2D-positive individuals and 0.73-0.74 for T2D-negative controls. Results obtained using nonphysically impaired controls were compared with highly physically impaired controls to test the hypothesis that nondiabetic conditions reduce classifier performance. Models built using a training set that included highly impaired controls with other conditions had worse performance (AUC 0.75-0.77; 95% CI 0.74-0.78; F1 scores in the range of 0.76-0.77 for T2D positives and 0.63-0.65 for controls). CONCLUSIONS: Granular measures of free-living physical activity can be used to successfully train machine learning models that are able to discriminate between individuals with T2D and normoglycemic controls, although with limitations because of the intrinsic noise in the data sets. From a broader clinical perspective, these findings motivate further research into the use of physical activity traces as a means of screening individuals at risk of diabetes and for early detection, in conjunction with routinely used risk scores, provided that appropriate quality control is enforced on the data collection protocol. |
format | Online Article Text |
id | pubmed-8080299 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-80802992021-05-06 Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort Lam, Benjamin Catt, Michael Cassidy, Sophie Bacardit, Jaume Darke, Philip Butterfield, Sam Alshabrawy, Ossama Trenell, Michael Missier, Paolo JMIR Diabetes Original Paper BACKGROUND: Between 2013 and 2015, the UK Biobank collected accelerometer traces from 103,712 volunteers aged between 40 and 69 years using wrist-worn triaxial accelerometers for 1 week. This data set has been used in the past to verify that individuals with chronic diseases exhibit reduced activity levels compared with healthy populations. However, the data set is likely to be noisy, as the devices were allocated to participants without a set of inclusion criteria, and the traces reflect free-living conditions. OBJECTIVE: This study aims to determine the extent to which accelerometer traces can be used to distinguish individuals with type 2 diabetes (T2D) from normoglycemic controls and to quantify their limitations. METHODS: Machine learning classifiers were trained using different feature sets to segregate individuals with T2D from normoglycemic individuals. Multiple criteria, based on a combination of self-assessment UK Biobank variables and primary care health records linked to UK Biobank participants, were used to identify 3103 individuals with T2D in this population. The remaining nondiabetic 19,852 participants were further scored on their physical activity impairment severity based on other conditions found in their primary care data, and those deemed likely physically impaired at the time were excluded. Physical activity features were first extracted from the raw accelerometer traces data set for each participant using an algorithm that extends the previously developed Biobank Accelerometry Analysis toolkit from Oxford University. These features were complemented by a selected collection of sociodemographic and lifestyle features available from UK Biobank. RESULTS: We tested 3 types of classifiers, with an area under the receiver operating characteristic curve (AUC) close to 0.86 (95% CI 0.85-0.87) for all 3 classifiers and F1 scores in the range of 0.80-0.82 for T2D-positive individuals and 0.73-0.74 for T2D-negative controls. Results obtained using nonphysically impaired controls were compared with highly physically impaired controls to test the hypothesis that nondiabetic conditions reduce classifier performance. Models built using a training set that included highly impaired controls with other conditions had worse performance (AUC 0.75-0.77; 95% CI 0.74-0.78; F1 scores in the range of 0.76-0.77 for T2D positives and 0.63-0.65 for controls). CONCLUSIONS: Granular measures of free-living physical activity can be used to successfully train machine learning models that are able to discriminate between individuals with T2D and normoglycemic controls, although with limitations because of the intrinsic noise in the data sets. From a broader clinical perspective, these findings motivate further research into the use of physical activity traces as a means of screening individuals at risk of diabetes and for early detection, in conjunction with routinely used risk scores, provided that appropriate quality control is enforced on the data collection protocol. JMIR Publications 2021-03-19 /pmc/articles/PMC8080299/ /pubmed/33739298 http://dx.doi.org/10.2196/23364 Text en ©Benjamin Lam, Michael Catt, Sophie Cassidy, Jaume Bacardit, Philip Darke, Sam Butterfield, Ossama Alshabrawy, Michael Trenell, Paolo Missier. Originally published in JMIR Diabetes (http://diabetes.jmir.org), 19.03.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Diabetes, is properly cited. The complete bibliographic information, a link to the original publication on http://diabetes.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Lam, Benjamin Catt, Michael Cassidy, Sophie Bacardit, Jaume Darke, Philip Butterfield, Sam Alshabrawy, Ossama Trenell, Michael Missier, Paolo Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort |
title | Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort |
title_full | Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort |
title_fullStr | Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort |
title_full_unstemmed | Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort |
title_short | Using Wearable Activity Trackers to Predict Type 2 Diabetes: Machine Learning–Based Cross-sectional Study of the UK Biobank Accelerometer Cohort |
title_sort | using wearable activity trackers to predict type 2 diabetes: machine learning–based cross-sectional study of the uk biobank accelerometer cohort |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8080299/ https://www.ncbi.nlm.nih.gov/pubmed/33739298 http://dx.doi.org/10.2196/23364 |
work_keys_str_mv | AT lambenjamin usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT cattmichael usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT cassidysophie usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT bacarditjaume usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT darkephilip usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT butterfieldsam usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT alshabrawyossama usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT trenellmichael usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort AT missierpaolo usingwearableactivitytrackerstopredicttype2diabetesmachinelearningbasedcrosssectionalstudyoftheukbiobankaccelerometercohort |