Cargando…

Metadata mapping and reuse in caBIG™

BACKGROUND: This paper proposes that interoperability across biomedical databases can be improved by utilizing a repository of Common Data Elements (CDEs), UML model class-attributes and simple lexical algorithms to facilitate the building domain models. This is examined in the context of an existin...

Descripción completa

Detalles Bibliográficos
Autores principales: Kunz, Isaac, Lin, Ming-Chin, Frey, Lewis
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646244/
https://www.ncbi.nlm.nih.gov/pubmed/19208192
http://dx.doi.org/10.1186/1471-2105-10-S2-S4
_version_ 1782164831134023680
author Kunz, Isaac
Lin, Ming-Chin
Frey, Lewis
author_facet Kunz, Isaac
Lin, Ming-Chin
Frey, Lewis
author_sort Kunz, Isaac
collection PubMed
description BACKGROUND: This paper proposes that interoperability across biomedical databases can be improved by utilizing a repository of Common Data Elements (CDEs), UML model class-attributes and simple lexical algorithms to facilitate the building domain models. This is examined in the context of an existing system, the National Cancer Institute (NCI)'s cancer Biomedical Informatics Grid (caBIG™). The goal is to demonstrate the deployment of open source tools that can be used to effectively map models and enable the reuse of existing information objects and CDEs in the development of new models for translational research applications. This effort is intended to help developers reuse appropriate CDEs to enable interoperability of their systems when developing within the caBIG™ framework or other frameworks that use metadata repositories. RESULTS: The Dice (di-grams) and Dynamic algorithms are compared and both algorithms have similar performance matching UML model class-attributes to CDE class object-property pairs. With algorithms used, the baselines for automatically finding the matches are reasonable for the data models examined. It suggests that automatic mapping of UML models and CDEs is feasible within the caBIG™ framework and potentially any framework that uses a metadata repository. CONCLUSION: This work opens up the possibility of using mapping algorithms to reduce cost and time required to map local data models to a reference data model such as those used within caBIG™. This effort contributes to facilitating the development of interoperable systems within caBIG™ as well as other metadata frameworks. Such efforts are critical to address the need to develop systems to handle enormous amounts of diverse data that can be leveraged from new biomedical methodologies.
format Text
id pubmed-2646244
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26462442009-02-23 Metadata mapping and reuse in caBIG™ Kunz, Isaac Lin, Ming-Chin Frey, Lewis BMC Bioinformatics Proceedings BACKGROUND: This paper proposes that interoperability across biomedical databases can be improved by utilizing a repository of Common Data Elements (CDEs), UML model class-attributes and simple lexical algorithms to facilitate the building domain models. This is examined in the context of an existing system, the National Cancer Institute (NCI)'s cancer Biomedical Informatics Grid (caBIG™). The goal is to demonstrate the deployment of open source tools that can be used to effectively map models and enable the reuse of existing information objects and CDEs in the development of new models for translational research applications. This effort is intended to help developers reuse appropriate CDEs to enable interoperability of their systems when developing within the caBIG™ framework or other frameworks that use metadata repositories. RESULTS: The Dice (di-grams) and Dynamic algorithms are compared and both algorithms have similar performance matching UML model class-attributes to CDE class object-property pairs. With algorithms used, the baselines for automatically finding the matches are reasonable for the data models examined. It suggests that automatic mapping of UML models and CDEs is feasible within the caBIG™ framework and potentially any framework that uses a metadata repository. CONCLUSION: This work opens up the possibility of using mapping algorithms to reduce cost and time required to map local data models to a reference data model such as those used within caBIG™. This effort contributes to facilitating the development of interoperable systems within caBIG™ as well as other metadata frameworks. Such efforts are critical to address the need to develop systems to handle enormous amounts of diverse data that can be leveraged from new biomedical methodologies. BioMed Central 2009-02-05 /pmc/articles/PMC2646244/ /pubmed/19208192 http://dx.doi.org/10.1186/1471-2105-10-S2-S4 Text en Copyright © 2009 Kunz et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Kunz, Isaac
Lin, Ming-Chin
Frey, Lewis
Metadata mapping and reuse in caBIG™
title Metadata mapping and reuse in caBIG™
title_full Metadata mapping and reuse in caBIG™
title_fullStr Metadata mapping and reuse in caBIG™
title_full_unstemmed Metadata mapping and reuse in caBIG™
title_short Metadata mapping and reuse in caBIG™
title_sort metadata mapping and reuse in cabig™
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646244/
https://www.ncbi.nlm.nih.gov/pubmed/19208192
http://dx.doi.org/10.1186/1471-2105-10-S2-S4
work_keys_str_mv AT kunzisaac metadatamappingandreuseincabig
AT linmingchin metadatamappingandreuseincabig
AT freylewis metadatamappingandreuseincabig