Cargando…

Automatic detection and resolution of measurement-unit conflicts in aggregated data

BACKGROUND: Measurement-unit conflicts are a perennial problem in integrative research domains such as clinical meta-analysis. As multi-national collaborations grow, as new measurement instruments appear, and as Linked Open Data infrastructures become increasingly pervasive, the number of such confl...

Descripción completa

Detalles Bibliográficos
Autores principales: Samadian, Soroush, McManus, Bruce, Wilkinson, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4101427/
https://www.ncbi.nlm.nih.gov/pubmed/25079396
http://dx.doi.org/10.1186/1755-8794-7-S1-S12
_version_ 1782480895908773888
author Samadian, Soroush
McManus, Bruce
Wilkinson, Mark
author_facet Samadian, Soroush
McManus, Bruce
Wilkinson, Mark
author_sort Samadian, Soroush
collection PubMed
description BACKGROUND: Measurement-unit conflicts are a perennial problem in integrative research domains such as clinical meta-analysis. As multi-national collaborations grow, as new measurement instruments appear, and as Linked Open Data infrastructures become increasingly pervasive, the number of such conflicts will similarly increase. METHODS: We propose a generic approach to the problem of (a) encoding measurement units in datasets in a machine-readable manner, (b) detecting when a dataset contained mixtures of measurement units, and (c) automatically converting any conflicting units into a desired unit, as defined for a given study. RESULTS: We utilized existing ontologies and standards for scientific data representation, measurement unit definition, and data manipulation to build a simple and flexible Semantic Web Service-based approach to measurement-unit harmonization. A cardiovascular patient cohort in which clinical measurements were recorded in a number of different units (e.g., mmHg and cmHg for blood pressure) was automatically classified into a number of clinical phenotypes, semantically defined using different measurement units. CONCLUSIONS: We demonstrate that through a combination of semantic standards and frameworks, unit integration problems can be automatically detected and resolved.
format Online
Article
Text
id pubmed-4101427
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41014272014-07-18 Automatic detection and resolution of measurement-unit conflicts in aggregated data Samadian, Soroush McManus, Bruce Wilkinson, Mark BMC Med Genomics Research BACKGROUND: Measurement-unit conflicts are a perennial problem in integrative research domains such as clinical meta-analysis. As multi-national collaborations grow, as new measurement instruments appear, and as Linked Open Data infrastructures become increasingly pervasive, the number of such conflicts will similarly increase. METHODS: We propose a generic approach to the problem of (a) encoding measurement units in datasets in a machine-readable manner, (b) detecting when a dataset contained mixtures of measurement units, and (c) automatically converting any conflicting units into a desired unit, as defined for a given study. RESULTS: We utilized existing ontologies and standards for scientific data representation, measurement unit definition, and data manipulation to build a simple and flexible Semantic Web Service-based approach to measurement-unit harmonization. A cardiovascular patient cohort in which clinical measurements were recorded in a number of different units (e.g., mmHg and cmHg for blood pressure) was automatically classified into a number of clinical phenotypes, semantically defined using different measurement units. CONCLUSIONS: We demonstrate that through a combination of semantic standards and frameworks, unit integration problems can be automatically detected and resolved. BioMed Central 2014-05-08 /pmc/articles/PMC4101427/ /pubmed/25079396 http://dx.doi.org/10.1186/1755-8794-7-S1-S12 Text en Copyright © 2014 Samadian et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Samadian, Soroush
McManus, Bruce
Wilkinson, Mark
Automatic detection and resolution of measurement-unit conflicts in aggregated data
title Automatic detection and resolution of measurement-unit conflicts in aggregated data
title_full Automatic detection and resolution of measurement-unit conflicts in aggregated data
title_fullStr Automatic detection and resolution of measurement-unit conflicts in aggregated data
title_full_unstemmed Automatic detection and resolution of measurement-unit conflicts in aggregated data
title_short Automatic detection and resolution of measurement-unit conflicts in aggregated data
title_sort automatic detection and resolution of measurement-unit conflicts in aggregated data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4101427/
https://www.ncbi.nlm.nih.gov/pubmed/25079396
http://dx.doi.org/10.1186/1755-8794-7-S1-S12
work_keys_str_mv AT samadiansoroush automaticdetectionandresolutionofmeasurementunitconflictsinaggregateddata
AT mcmanusbruce automaticdetectionandresolutionofmeasurementunitconflictsinaggregateddata
AT wilkinsonmark automaticdetectionandresolutionofmeasurementunitconflictsinaggregateddata