Cargando…

Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach

BACKGROUND: Today, there is an increasing need to centralize and standardize electronic health data within clinical research as the volume of data continues to balloon. Domain-specific common data elements (CDEs) are emerging as a standard approach to clinical research data capturing and reporting....

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Henry W, Du, Jingcheng, Song, Hsing-Yi, Liu, Xiangyu, Jiang, Guoqian, Tao, Cui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5843793/
https://www.ncbi.nlm.nih.gov/pubmed/29472179
http://dx.doi.org/10.2196/medinform.8175
_version_ 1783305139933675520
author Chen, Henry W
Du, Jingcheng
Song, Hsing-Yi
Liu, Xiangyu
Jiang, Guoqian
Tao, Cui
author_facet Chen, Henry W
Du, Jingcheng
Song, Hsing-Yi
Liu, Xiangyu
Jiang, Guoqian
Tao, Cui
author_sort Chen, Henry W
collection PubMed
description BACKGROUND: Today, there is an increasing need to centralize and standardize electronic health data within clinical research as the volume of data continues to balloon. Domain-specific common data elements (CDEs) are emerging as a standard approach to clinical research data capturing and reporting. Recent efforts to standardize clinical study CDEs have been of great benefit in facilitating data integration and data sharing. The importance of the temporal dimension of clinical research studies has been well recognized; however, very few studies have focused on the formal representation of temporal constraints and temporal relationships within clinical research data in the biomedical research community. In particular, temporal information can be extremely powerful to enable high-quality cancer research. OBJECTIVE: The objective of the study was to develop and evaluate an ontological approach to represent the temporal aspects of cancer study CDEs. METHODS: We used CDEs recorded in the National Cancer Institute (NCI) Cancer Data Standards Repository (caDSR) and created a CDE parser to extract time-relevant CDEs from the caDSR. Using the Web Ontology Language (OWL)–based Time Event Ontology (TEO), we manually derived representative patterns to semantically model the temporal components of the CDEs using an observing set of randomly selected time-related CDEs (n=600) to create a set of TEO ontological representation patterns. In evaluating TEO’s ability to represent the temporal components of the CDEs, this set of representation patterns was tested against two test sets of randomly selected time-related CDEs (n=425). RESULTS: It was found that 94.2% (801/850) of the CDEs in the test sets could be represented by the TEO representation patterns. CONCLUSIONS: In conclusion, TEO is a good ontological model for representing the temporal components of the CDEs recorded in caDSR. Our representative model can harness the Semantic Web reasoning and inferencing functionalities and present a means for temporal CDEs to be machine-readable, streamlining meaningful searches.
format Online
Article
Text
id pubmed-5843793
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-58437932018-03-19 Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach Chen, Henry W Du, Jingcheng Song, Hsing-Yi Liu, Xiangyu Jiang, Guoqian Tao, Cui JMIR Med Inform Original Paper BACKGROUND: Today, there is an increasing need to centralize and standardize electronic health data within clinical research as the volume of data continues to balloon. Domain-specific common data elements (CDEs) are emerging as a standard approach to clinical research data capturing and reporting. Recent efforts to standardize clinical study CDEs have been of great benefit in facilitating data integration and data sharing. The importance of the temporal dimension of clinical research studies has been well recognized; however, very few studies have focused on the formal representation of temporal constraints and temporal relationships within clinical research data in the biomedical research community. In particular, temporal information can be extremely powerful to enable high-quality cancer research. OBJECTIVE: The objective of the study was to develop and evaluate an ontological approach to represent the temporal aspects of cancer study CDEs. METHODS: We used CDEs recorded in the National Cancer Institute (NCI) Cancer Data Standards Repository (caDSR) and created a CDE parser to extract time-relevant CDEs from the caDSR. Using the Web Ontology Language (OWL)–based Time Event Ontology (TEO), we manually derived representative patterns to semantically model the temporal components of the CDEs using an observing set of randomly selected time-related CDEs (n=600) to create a set of TEO ontological representation patterns. In evaluating TEO’s ability to represent the temporal components of the CDEs, this set of representation patterns was tested against two test sets of randomly selected time-related CDEs (n=425). RESULTS: It was found that 94.2% (801/850) of the CDEs in the test sets could be represented by the TEO representation patterns. CONCLUSIONS: In conclusion, TEO is a good ontological model for representing the temporal components of the CDEs recorded in caDSR. Our representative model can harness the Semantic Web reasoning and inferencing functionalities and present a means for temporal CDEs to be machine-readable, streamlining meaningful searches. JMIR Publications 2018-02-22 /pmc/articles/PMC5843793/ /pubmed/29472179 http://dx.doi.org/10.2196/medinform.8175 Text en ©Henry W Chen, Jingcheng Du, Hsing-Yi Song, Xiangyu Liu, Guoqian Jiang, Cui Tao. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 22.02.2018. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Chen, Henry W
Du, Jingcheng
Song, Hsing-Yi
Liu, Xiangyu
Jiang, Guoqian
Tao, Cui
Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach
title Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach
title_full Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach
title_fullStr Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach
title_full_unstemmed Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach
title_short Representation of Time-Relevant Common Data Elements in the Cancer Data Standards Repository: Statistical Evaluation of an Ontological Approach
title_sort representation of time-relevant common data elements in the cancer data standards repository: statistical evaluation of an ontological approach
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5843793/
https://www.ncbi.nlm.nih.gov/pubmed/29472179
http://dx.doi.org/10.2196/medinform.8175
work_keys_str_mv AT chenhenryw representationoftimerelevantcommondataelementsinthecancerdatastandardsrepositorystatisticalevaluationofanontologicalapproach
AT dujingcheng representationoftimerelevantcommondataelementsinthecancerdatastandardsrepositorystatisticalevaluationofanontologicalapproach
AT songhsingyi representationoftimerelevantcommondataelementsinthecancerdatastandardsrepositorystatisticalevaluationofanontologicalapproach
AT liuxiangyu representationoftimerelevantcommondataelementsinthecancerdatastandardsrepositorystatisticalevaluationofanontologicalapproach
AT jiangguoqian representationoftimerelevantcommondataelementsinthecancerdatastandardsrepositorystatisticalevaluationofanontologicalapproach
AT taocui representationoftimerelevantcommondataelementsinthecancerdatastandardsrepositorystatisticalevaluationofanontologicalapproach