Cargando…

Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data

BACKGROUND: Physical activity data provides important information on disease onset, progression, and treatment outcomes. Although analyzing physical activity data in conjunction with other clinical and microbiological data will lead to new insights crucial for improving human health, it has been ham...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Hyeoneui, Mentzer, Jessica, Taira, Ricky
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658272/
https://www.ncbi.nlm.nih.gov/pubmed/31012864
http://dx.doi.org/10.2196/12776
_version_ 1783438935718887424
author Kim, Hyeoneui
Mentzer, Jessica
Taira, Ricky
author_facet Kim, Hyeoneui
Mentzer, Jessica
Taira, Ricky
author_sort Kim, Hyeoneui
collection PubMed
description BACKGROUND: Physical activity data provides important information on disease onset, progression, and treatment outcomes. Although analyzing physical activity data in conjunction with other clinical and microbiological data will lead to new insights crucial for improving human health, it has been hampered partly because of the large variations in the way the data are collected and presented. OBJECTIVE: The aim of this study was to develop a Physical Activity Ontology (PACO) to support structuring and standardizing heterogeneous descriptions of physical activities. METHODS: We prepared a corpus of 1140 unique sentences collected from various physical activity questionnaires and scales as well as existing standardized terminologies and ontologies. We extracted concepts relevant to physical activity from the corpus using a natural language processing toolkit called Multipurpose Text Processing Tool. The target concepts were formalized into an ontology using Protégé (version 4). Evaluation of PACO was performed to ensure logical and structural consistency as well as adherence to the best practice principles of building an ontology. A use case application of PACO was demonstrated by structuring and standardizing 36 exercise habit statements and then automatically classifying them to a defined class of either sufficiently active or insufficiently active using FaCT++, an ontology reasoner available in Protégé. RESULTS: PACO was constructed using 268 unique concepts extracted from the questionnaires and assessment scales. PACO contains 225 classes including 9 defined classes, 20 object properties, 1 data property, and 23 instances (excluding 36 exercise statements). The maximum depth of classes is 4, and the maximum number of siblings is 38. The evaluations with ontology auditing tools confirmed that PACO is structurally and logically consistent and satisfies the majority of the best practice rules of ontology authoring. We showed in a small sample of 36 exercise habit statements that we could formally represent them using PACO concepts and object properties. The formal representation was used to infer a patient activity status category of sufficiently active or insufficiently active using the FaCT++ reasoner. CONCLUSIONS: As a first step toward standardizing and structuring heterogeneous descriptions of physical activities for integrative data analyses, PACO was constructed based on the concepts collected from physical activity questionnaires and assessment scales. PACO was evaluated to be structurally consistent and compliant to ontology authoring principles. PACO was also demonstrated to be potentially useful in standardizing heterogeneous physical activity descriptions and classifying them into clinically meaningful categories that reflect adequacy of exercise.
format Online
Article
Text
id pubmed-6658272
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-66582722019-07-31 Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data Kim, Hyeoneui Mentzer, Jessica Taira, Ricky J Med Internet Res Original Paper BACKGROUND: Physical activity data provides important information on disease onset, progression, and treatment outcomes. Although analyzing physical activity data in conjunction with other clinical and microbiological data will lead to new insights crucial for improving human health, it has been hampered partly because of the large variations in the way the data are collected and presented. OBJECTIVE: The aim of this study was to develop a Physical Activity Ontology (PACO) to support structuring and standardizing heterogeneous descriptions of physical activities. METHODS: We prepared a corpus of 1140 unique sentences collected from various physical activity questionnaires and scales as well as existing standardized terminologies and ontologies. We extracted concepts relevant to physical activity from the corpus using a natural language processing toolkit called Multipurpose Text Processing Tool. The target concepts were formalized into an ontology using Protégé (version 4). Evaluation of PACO was performed to ensure logical and structural consistency as well as adherence to the best practice principles of building an ontology. A use case application of PACO was demonstrated by structuring and standardizing 36 exercise habit statements and then automatically classifying them to a defined class of either sufficiently active or insufficiently active using FaCT++, an ontology reasoner available in Protégé. RESULTS: PACO was constructed using 268 unique concepts extracted from the questionnaires and assessment scales. PACO contains 225 classes including 9 defined classes, 20 object properties, 1 data property, and 23 instances (excluding 36 exercise statements). The maximum depth of classes is 4, and the maximum number of siblings is 38. The evaluations with ontology auditing tools confirmed that PACO is structurally and logically consistent and satisfies the majority of the best practice rules of ontology authoring. We showed in a small sample of 36 exercise habit statements that we could formally represent them using PACO concepts and object properties. The formal representation was used to infer a patient activity status category of sufficiently active or insufficiently active using the FaCT++ reasoner. CONCLUSIONS: As a first step toward standardizing and structuring heterogeneous descriptions of physical activities for integrative data analyses, PACO was constructed based on the concepts collected from physical activity questionnaires and assessment scales. PACO was evaluated to be structurally consistent and compliant to ontology authoring principles. PACO was also demonstrated to be potentially useful in standardizing heterogeneous physical activity descriptions and classifying them into clinically meaningful categories that reflect adequacy of exercise. JMIR Publications 2019-04-23 /pmc/articles/PMC6658272/ /pubmed/31012864 http://dx.doi.org/10.2196/12776 Text en ©Hyeoneui Kim, Jessica Mentzer, Ricky Taira. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 23.04.2019. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Kim, Hyeoneui
Mentzer, Jessica
Taira, Ricky
Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data
title Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data
title_full Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data
title_fullStr Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data
title_full_unstemmed Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data
title_short Developing a Physical Activity Ontology to Support the Interoperability of Physical Activity Data
title_sort developing a physical activity ontology to support the interoperability of physical activity data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658272/
https://www.ncbi.nlm.nih.gov/pubmed/31012864
http://dx.doi.org/10.2196/12776
work_keys_str_mv AT kimhyeoneui developingaphysicalactivityontologytosupporttheinteroperabilityofphysicalactivitydata
AT mentzerjessica developingaphysicalactivityontologytosupporttheinteroperabilityofphysicalactivitydata
AT tairaricky developingaphysicalactivityontologytosupporttheinteroperabilityofphysicalactivitydata