Cargando…
ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
BACKGROUND: The volume and complexity of patient data – especially in personalised medicine – is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888420/ https://www.ncbi.nlm.nih.gov/pubmed/27245222 http://dx.doi.org/10.1186/s12874-016-0164-9 |
_version_ | 1782434849691271168 |
---|---|
author | Dugas, Martin Meidt, Alexandra Neuhaus, Philipp Storck, Michael Varghese, Julian |
author_facet | Dugas, Martin Meidt, Alexandra Neuhaus, Philipp Storck, Michael Varghese, Julian |
author_sort | Dugas, Martin |
collection | PubMed |
description | BACKGROUND: The volume and complexity of patient data – especially in personalised medicine – is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical trials. In oncology hundreds of mutations can potentially be detected for each patient by genomic profiling. Therefore data integration from multiple sources constitutes a key challenge for medical research and healthcare. METHODS: Semantic annotation of data elements can facilitate to identify matching data elements in different sources and thereby supports data integration. Millions of different annotations are required due to the semantic richness of patient data. These annotations should be uniform, i.e., two matching data elements shall contain the same annotations. However, large terminologies like SNOMED CT or UMLS don’t provide uniform coding. It is proposed to develop semantic annotations of medical data elements based on a large-scale public metadata repository. To achieve uniform codes, semantic annotations shall be re-used if a matching data element is available in the metadata repository. RESULTS: A web-based tool called ODMedit (https://odmeditor.uni-muenster.de/) was developed to create data models with uniform semantic annotations. It contains ~800,000 terms with semantic annotations which were derived from ~5,800 models from the portal of medical data models (MDM). The tool was successfully applied to manually annotate 22 forms with 292 data items from CDISC and to update 1,495 data models of the MDM portal. CONCLUSION: Uniform manual semantic annotation of data models is feasible in principle, but requires a large-scale collaborative effort due to the semantic richness of patient data. A web-based tool for these annotations is available, which is linked to a public metadata repository. |
format | Online Article Text |
id | pubmed-4888420 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48884202016-06-02 ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository Dugas, Martin Meidt, Alexandra Neuhaus, Philipp Storck, Michael Varghese, Julian BMC Med Res Methodol Research Article BACKGROUND: The volume and complexity of patient data – especially in personalised medicine – is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical trials. In oncology hundreds of mutations can potentially be detected for each patient by genomic profiling. Therefore data integration from multiple sources constitutes a key challenge for medical research and healthcare. METHODS: Semantic annotation of data elements can facilitate to identify matching data elements in different sources and thereby supports data integration. Millions of different annotations are required due to the semantic richness of patient data. These annotations should be uniform, i.e., two matching data elements shall contain the same annotations. However, large terminologies like SNOMED CT or UMLS don’t provide uniform coding. It is proposed to develop semantic annotations of medical data elements based on a large-scale public metadata repository. To achieve uniform codes, semantic annotations shall be re-used if a matching data element is available in the metadata repository. RESULTS: A web-based tool called ODMedit (https://odmeditor.uni-muenster.de/) was developed to create data models with uniform semantic annotations. It contains ~800,000 terms with semantic annotations which were derived from ~5,800 models from the portal of medical data models (MDM). The tool was successfully applied to manually annotate 22 forms with 292 data items from CDISC and to update 1,495 data models of the MDM portal. CONCLUSION: Uniform manual semantic annotation of data models is feasible in principle, but requires a large-scale collaborative effort due to the semantic richness of patient data. A web-based tool for these annotations is available, which is linked to a public metadata repository. BioMed Central 2016-06-01 /pmc/articles/PMC4888420/ /pubmed/27245222 http://dx.doi.org/10.1186/s12874-016-0164-9 Text en © Dugas et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Dugas, Martin Meidt, Alexandra Neuhaus, Philipp Storck, Michael Varghese, Julian ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository |
title | ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository |
title_full | ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository |
title_fullStr | ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository |
title_full_unstemmed | ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository |
title_short | ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository |
title_sort | odmedit: uniform semantic annotation for data integration in medicine based on a public metadata repository |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888420/ https://www.ncbi.nlm.nih.gov/pubmed/27245222 http://dx.doi.org/10.1186/s12874-016-0164-9 |
work_keys_str_mv | AT dugasmartin odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository AT meidtalexandra odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository AT neuhausphilipp odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository AT storckmichael odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository AT varghesejulian odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository |