Cargando…

ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository

BACKGROUND: The volume and complexity of patient data – especially in personalised medicine – is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical...

Descripción completa

Detalles Bibliográficos
Autores principales: Dugas, Martin, Meidt, Alexandra, Neuhaus, Philipp, Storck, Michael, Varghese, Julian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888420/
https://www.ncbi.nlm.nih.gov/pubmed/27245222
http://dx.doi.org/10.1186/s12874-016-0164-9
_version_ 1782434849691271168
author Dugas, Martin
Meidt, Alexandra
Neuhaus, Philipp
Storck, Michael
Varghese, Julian
author_facet Dugas, Martin
Meidt, Alexandra
Neuhaus, Philipp
Storck, Michael
Varghese, Julian
author_sort Dugas, Martin
collection PubMed
description BACKGROUND: The volume and complexity of patient data – especially in personalised medicine – is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical trials. In oncology hundreds of mutations can potentially be detected for each patient by genomic profiling. Therefore data integration from multiple sources constitutes a key challenge for medical research and healthcare. METHODS: Semantic annotation of data elements can facilitate to identify matching data elements in different sources and thereby supports data integration. Millions of different annotations are required due to the semantic richness of patient data. These annotations should be uniform, i.e., two matching data elements shall contain the same annotations. However, large terminologies like SNOMED CT or UMLS don’t provide uniform coding. It is proposed to develop semantic annotations of medical data elements based on a large-scale public metadata repository. To achieve uniform codes, semantic annotations shall be re-used if a matching data element is available in the metadata repository. RESULTS: A web-based tool called ODMedit (https://odmeditor.uni-muenster.de/) was developed to create data models with uniform semantic annotations. It contains ~800,000 terms with semantic annotations which were derived from ~5,800 models from the portal of medical data models (MDM). The tool was successfully applied to manually annotate 22 forms with 292 data items from CDISC and to update 1,495 data models of the MDM portal. CONCLUSION: Uniform manual semantic annotation of data models is feasible in principle, but requires a large-scale collaborative effort due to the semantic richness of patient data. A web-based tool for these annotations is available, which is linked to a public metadata repository.
format Online
Article
Text
id pubmed-4888420
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48884202016-06-02 ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository Dugas, Martin Meidt, Alexandra Neuhaus, Philipp Storck, Michael Varghese, Julian BMC Med Res Methodol Research Article BACKGROUND: The volume and complexity of patient data – especially in personalised medicine – is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical trials. In oncology hundreds of mutations can potentially be detected for each patient by genomic profiling. Therefore data integration from multiple sources constitutes a key challenge for medical research and healthcare. METHODS: Semantic annotation of data elements can facilitate to identify matching data elements in different sources and thereby supports data integration. Millions of different annotations are required due to the semantic richness of patient data. These annotations should be uniform, i.e., two matching data elements shall contain the same annotations. However, large terminologies like SNOMED CT or UMLS don’t provide uniform coding. It is proposed to develop semantic annotations of medical data elements based on a large-scale public metadata repository. To achieve uniform codes, semantic annotations shall be re-used if a matching data element is available in the metadata repository. RESULTS: A web-based tool called ODMedit (https://odmeditor.uni-muenster.de/) was developed to create data models with uniform semantic annotations. It contains ~800,000 terms with semantic annotations which were derived from ~5,800 models from the portal of medical data models (MDM). The tool was successfully applied to manually annotate 22 forms with 292 data items from CDISC and to update 1,495 data models of the MDM portal. CONCLUSION: Uniform manual semantic annotation of data models is feasible in principle, but requires a large-scale collaborative effort due to the semantic richness of patient data. A web-based tool for these annotations is available, which is linked to a public metadata repository. BioMed Central 2016-06-01 /pmc/articles/PMC4888420/ /pubmed/27245222 http://dx.doi.org/10.1186/s12874-016-0164-9 Text en © Dugas et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Dugas, Martin
Meidt, Alexandra
Neuhaus, Philipp
Storck, Michael
Varghese, Julian
ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
title ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
title_full ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
title_fullStr ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
title_full_unstemmed ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
title_short ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
title_sort odmedit: uniform semantic annotation for data integration in medicine based on a public metadata repository
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888420/
https://www.ncbi.nlm.nih.gov/pubmed/27245222
http://dx.doi.org/10.1186/s12874-016-0164-9
work_keys_str_mv AT dugasmartin odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository
AT meidtalexandra odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository
AT neuhausphilipp odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository
AT storckmichael odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository
AT varghesejulian odmedituniformsemanticannotationfordataintegrationinmedicinebasedonapublicmetadatarepository