Cargando…

What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data

Running is a popular form of physical activity. Personal, social, and environmental determinants influence the engagement of the individual. To get insight in the relation between running behavior and external situations for different types of users, we carried out an extensive data mining study on...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shihan, Scheider, Simon, Sporrel, Karlijn, Deutekom, Marije, Timmer, Joris, Kröse, Ben
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7820721/
https://www.ncbi.nlm.nih.gov/pubmed/33490006
http://dx.doi.org/10.3389/fpubh.2020.536370
_version_ 1783639273344335872
author Wang, Shihan
Scheider, Simon
Sporrel, Karlijn
Deutekom, Marije
Timmer, Joris
Kröse, Ben
author_facet Wang, Shihan
Scheider, Simon
Sporrel, Karlijn
Deutekom, Marije
Timmer, Joris
Kröse, Ben
author_sort Wang, Shihan
collection PubMed
description Running is a popular form of physical activity. Personal, social, and environmental determinants influence the engagement of the individual. To get insight in the relation between running behavior and external situations for different types of users, we carried out an extensive data mining study on large-scale datasets. We combined 4 years of historical running data (collected by a mobile exercise application from over 10K participants) with weather, topographical and demographical datasets. We introduce weighted frequent item mining for the analysis of the data. In this way, we capture temporal and environmental situations that frequently associate with different running performances. The results show that specific temporal and environmental situations (hour in a day, day in a week, temperature, distance to residential areas, and population density) influence the running performance of users more than other situational features. Hierarchical agglomerative clustering on the running data is used to split runners in two clusters (with sustained and less sustained running behavior). We compared the two groups of runners and found that runners with less sustained behavior are more sensitive to the environmental situations (especially several weather and location related features, such as temperature, weather type, distance to the nearest park) than regular runners. Further analysis focused on the situational features for the less sustained runners. Results show that specific feature values correspond to a better or worse running distance. Not only the influence of individual features was examined but also the interplay between features. Our findings provide important empirical evidence that the role of external situations in the running behavior of individuals can be derived from analysis of the combined historical datasets. This opens up a large potential to take those situations specifically into consideration when supporting individuals which show less sustained behavior.
format Online
Article
Text
id pubmed-7820721
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-78207212021-01-23 What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data Wang, Shihan Scheider, Simon Sporrel, Karlijn Deutekom, Marije Timmer, Joris Kröse, Ben Front Public Health Public Health Running is a popular form of physical activity. Personal, social, and environmental determinants influence the engagement of the individual. To get insight in the relation between running behavior and external situations for different types of users, we carried out an extensive data mining study on large-scale datasets. We combined 4 years of historical running data (collected by a mobile exercise application from over 10K participants) with weather, topographical and demographical datasets. We introduce weighted frequent item mining for the analysis of the data. In this way, we capture temporal and environmental situations that frequently associate with different running performances. The results show that specific temporal and environmental situations (hour in a day, day in a week, temperature, distance to residential areas, and population density) influence the running performance of users more than other situational features. Hierarchical agglomerative clustering on the running data is used to split runners in two clusters (with sustained and less sustained running behavior). We compared the two groups of runners and found that runners with less sustained behavior are more sensitive to the environmental situations (especially several weather and location related features, such as temperature, weather type, distance to the nearest park) than regular runners. Further analysis focused on the situational features for the less sustained runners. Results show that specific feature values correspond to a better or worse running distance. Not only the influence of individual features was examined but also the interplay between features. Our findings provide important empirical evidence that the role of external situations in the running behavior of individuals can be derived from analysis of the combined historical datasets. This opens up a large potential to take those situations specifically into consideration when supporting individuals which show less sustained behavior. Frontiers Media S.A. 2021-01-11 /pmc/articles/PMC7820721/ /pubmed/33490006 http://dx.doi.org/10.3389/fpubh.2020.536370 Text en Copyright © 2021 Wang, Scheider, Sporrel, Deutekom, Timmer and Kröse. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Wang, Shihan
Scheider, Simon
Sporrel, Karlijn
Deutekom, Marije
Timmer, Joris
Kröse, Ben
What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data
title What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data
title_full What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data
title_fullStr What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data
title_full_unstemmed What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data
title_short What Are Good Situations for Running? A Machine Learning Study Using Mobile and Geographical Data
title_sort what are good situations for running? a machine learning study using mobile and geographical data
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7820721/
https://www.ncbi.nlm.nih.gov/pubmed/33490006
http://dx.doi.org/10.3389/fpubh.2020.536370
work_keys_str_mv AT wangshihan whataregoodsituationsforrunningamachinelearningstudyusingmobileandgeographicaldata
AT scheidersimon whataregoodsituationsforrunningamachinelearningstudyusingmobileandgeographicaldata
AT sporrelkarlijn whataregoodsituationsforrunningamachinelearningstudyusingmobileandgeographicaldata
AT deutekommarije whataregoodsituationsforrunningamachinelearningstudyusingmobileandgeographicaldata
AT timmerjoris whataregoodsituationsforrunningamachinelearningstudyusingmobileandgeographicaldata
AT kroseben whataregoodsituationsforrunningamachinelearningstudyusingmobileandgeographicaldata