Cargando…
Metadata mapping and reuse in caBIG™
BACKGROUND: This paper proposes that interoperability across biomedical databases can be improved by utilizing a repository of Common Data Elements (CDEs), UML model class-attributes and simple lexical algorithms to facilitate the building domain models. This is examined in the context of an existin...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646244/ https://www.ncbi.nlm.nih.gov/pubmed/19208192 http://dx.doi.org/10.1186/1471-2105-10-S2-S4 |
_version_ | 1782164831134023680 |
---|---|
author | Kunz, Isaac Lin, Ming-Chin Frey, Lewis |
author_facet | Kunz, Isaac Lin, Ming-Chin Frey, Lewis |
author_sort | Kunz, Isaac |
collection | PubMed |
description | BACKGROUND: This paper proposes that interoperability across biomedical databases can be improved by utilizing a repository of Common Data Elements (CDEs), UML model class-attributes and simple lexical algorithms to facilitate the building domain models. This is examined in the context of an existing system, the National Cancer Institute (NCI)'s cancer Biomedical Informatics Grid (caBIG™). The goal is to demonstrate the deployment of open source tools that can be used to effectively map models and enable the reuse of existing information objects and CDEs in the development of new models for translational research applications. This effort is intended to help developers reuse appropriate CDEs to enable interoperability of their systems when developing within the caBIG™ framework or other frameworks that use metadata repositories. RESULTS: The Dice (di-grams) and Dynamic algorithms are compared and both algorithms have similar performance matching UML model class-attributes to CDE class object-property pairs. With algorithms used, the baselines for automatically finding the matches are reasonable for the data models examined. It suggests that automatic mapping of UML models and CDEs is feasible within the caBIG™ framework and potentially any framework that uses a metadata repository. CONCLUSION: This work opens up the possibility of using mapping algorithms to reduce cost and time required to map local data models to a reference data model such as those used within caBIG™. This effort contributes to facilitating the development of interoperable systems within caBIG™ as well as other metadata frameworks. Such efforts are critical to address the need to develop systems to handle enormous amounts of diverse data that can be leveraged from new biomedical methodologies. |
format | Text |
id | pubmed-2646244 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26462442009-02-23 Metadata mapping and reuse in caBIG™ Kunz, Isaac Lin, Ming-Chin Frey, Lewis BMC Bioinformatics Proceedings BACKGROUND: This paper proposes that interoperability across biomedical databases can be improved by utilizing a repository of Common Data Elements (CDEs), UML model class-attributes and simple lexical algorithms to facilitate the building domain models. This is examined in the context of an existing system, the National Cancer Institute (NCI)'s cancer Biomedical Informatics Grid (caBIG™). The goal is to demonstrate the deployment of open source tools that can be used to effectively map models and enable the reuse of existing information objects and CDEs in the development of new models for translational research applications. This effort is intended to help developers reuse appropriate CDEs to enable interoperability of their systems when developing within the caBIG™ framework or other frameworks that use metadata repositories. RESULTS: The Dice (di-grams) and Dynamic algorithms are compared and both algorithms have similar performance matching UML model class-attributes to CDE class object-property pairs. With algorithms used, the baselines for automatically finding the matches are reasonable for the data models examined. It suggests that automatic mapping of UML models and CDEs is feasible within the caBIG™ framework and potentially any framework that uses a metadata repository. CONCLUSION: This work opens up the possibility of using mapping algorithms to reduce cost and time required to map local data models to a reference data model such as those used within caBIG™. This effort contributes to facilitating the development of interoperable systems within caBIG™ as well as other metadata frameworks. Such efforts are critical to address the need to develop systems to handle enormous amounts of diverse data that can be leveraged from new biomedical methodologies. BioMed Central 2009-02-05 /pmc/articles/PMC2646244/ /pubmed/19208192 http://dx.doi.org/10.1186/1471-2105-10-S2-S4 Text en Copyright © 2009 Kunz et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Kunz, Isaac Lin, Ming-Chin Frey, Lewis Metadata mapping and reuse in caBIG™ |
title | Metadata mapping and reuse in caBIG™ |
title_full | Metadata mapping and reuse in caBIG™ |
title_fullStr | Metadata mapping and reuse in caBIG™ |
title_full_unstemmed | Metadata mapping and reuse in caBIG™ |
title_short | Metadata mapping and reuse in caBIG™ |
title_sort | metadata mapping and reuse in cabig™ |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646244/ https://www.ncbi.nlm.nih.gov/pubmed/19208192 http://dx.doi.org/10.1186/1471-2105-10-S2-S4 |
work_keys_str_mv | AT kunzisaac metadatamappingandreuseincabig AT linmingchin metadatamappingandreuseincabig AT freylewis metadatamappingandreuseincabig |