Cargando…

CODA-ML: context-specific biological knowledge representation for systemic physiology analysis

BACKGROUND: Computational analysis of complex diseases involving multiple organs requires the integration of multiple different models into a unified model. Different models are often constructed in heterogeneous formats. Thus, the integration of the models requires a standard language format that c...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Mijin, Yim, Soorin, Kim, Gwangmin, Lee, Saehwan, Jeong, Chungsun, Lee, Doheon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538558/
https://www.ncbi.nlm.nih.gov/pubmed/31138123
http://dx.doi.org/10.1186/s12859-019-2812-7
_version_ 1783422188231065600
author Kwon, Mijin
Yim, Soorin
Kim, Gwangmin
Lee, Saehwan
Jeong, Chungsun
Lee, Doheon
author_facet Kwon, Mijin
Yim, Soorin
Kim, Gwangmin
Lee, Saehwan
Jeong, Chungsun
Lee, Doheon
author_sort Kwon, Mijin
collection PubMed
description BACKGROUND: Computational analysis of complex diseases involving multiple organs requires the integration of multiple different models into a unified model. Different models are often constructed in heterogeneous formats. Thus, the integration of the models requires a standard language format that can effectively represent essential biological information. However, the previously introduced formats have limitations that prevent from adequately representing essential biological information, particularly specifications of bio-molecules and biological contexts. RESULTS: We defined an XML-based markup language called context-oriented directed association markup language (CODA-ML), which better represents essential biological information. The CODA-ML has two major strengths in designating molecular specifications and biological contexts. It can cover heterogeneous entity types involved in biological events (e.g. gene/protein, compound, cellular function, disease). Molecular types of entities can have molecular specifications which include detailed information of a molecule from isoforms to modifications, enabling high-resolution representation of molecules. In addition, it can distinguish biological events that vary depending on different biological contexts such as cell types or disease conditions. Especially representation of inter-cellular events as well as intra-cellular events is available. These two major strengths can resolve contradictory associations when different models are integrated into one unified model, which improves the accuracy of the model. CONCLUSIONS: With the CODA-ML, diverse models such as signaling pathways, metabolic pathways, and gene regulatory pathways can be represented in a unified language format. Heterogeneous entity types can be covered by the CODA-ML, thus it enables detailed description for the mechanisms of diseases or drugs from multiple perspectives (e.g., molecule, function or disease). The CODA-ML is expected to help integrate different models into one systemic model in an efficient and effective. The unified model can be used to perform computational analysis not only for cancer but also for other complex diseases involving multiple organs beyond a single cell. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2812-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6538558
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65385582019-06-03 CODA-ML: context-specific biological knowledge representation for systemic physiology analysis Kwon, Mijin Yim, Soorin Kim, Gwangmin Lee, Saehwan Jeong, Chungsun Lee, Doheon BMC Bioinformatics Research BACKGROUND: Computational analysis of complex diseases involving multiple organs requires the integration of multiple different models into a unified model. Different models are often constructed in heterogeneous formats. Thus, the integration of the models requires a standard language format that can effectively represent essential biological information. However, the previously introduced formats have limitations that prevent from adequately representing essential biological information, particularly specifications of bio-molecules and biological contexts. RESULTS: We defined an XML-based markup language called context-oriented directed association markup language (CODA-ML), which better represents essential biological information. The CODA-ML has two major strengths in designating molecular specifications and biological contexts. It can cover heterogeneous entity types involved in biological events (e.g. gene/protein, compound, cellular function, disease). Molecular types of entities can have molecular specifications which include detailed information of a molecule from isoforms to modifications, enabling high-resolution representation of molecules. In addition, it can distinguish biological events that vary depending on different biological contexts such as cell types or disease conditions. Especially representation of inter-cellular events as well as intra-cellular events is available. These two major strengths can resolve contradictory associations when different models are integrated into one unified model, which improves the accuracy of the model. CONCLUSIONS: With the CODA-ML, diverse models such as signaling pathways, metabolic pathways, and gene regulatory pathways can be represented in a unified language format. Heterogeneous entity types can be covered by the CODA-ML, thus it enables detailed description for the mechanisms of diseases or drugs from multiple perspectives (e.g., molecule, function or disease). The CODA-ML is expected to help integrate different models into one systemic model in an efficient and effective. The unified model can be used to perform computational analysis not only for cancer but also for other complex diseases involving multiple organs beyond a single cell. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2812-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-05-29 /pmc/articles/PMC6538558/ /pubmed/31138123 http://dx.doi.org/10.1186/s12859-019-2812-7 Text en © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Kwon, Mijin
Yim, Soorin
Kim, Gwangmin
Lee, Saehwan
Jeong, Chungsun
Lee, Doheon
CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
title CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
title_full CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
title_fullStr CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
title_full_unstemmed CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
title_short CODA-ML: context-specific biological knowledge representation for systemic physiology analysis
title_sort coda-ml: context-specific biological knowledge representation for systemic physiology analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538558/
https://www.ncbi.nlm.nih.gov/pubmed/31138123
http://dx.doi.org/10.1186/s12859-019-2812-7
work_keys_str_mv AT kwonmijin codamlcontextspecificbiologicalknowledgerepresentationforsystemicphysiologyanalysis
AT yimsoorin codamlcontextspecificbiologicalknowledgerepresentationforsystemicphysiologyanalysis
AT kimgwangmin codamlcontextspecificbiologicalknowledgerepresentationforsystemicphysiologyanalysis
AT leesaehwan codamlcontextspecificbiologicalknowledgerepresentationforsystemicphysiologyanalysis
AT jeongchungsun codamlcontextspecificbiologicalknowledgerepresentationforsystemicphysiologyanalysis
AT leedoheon codamlcontextspecificbiologicalknowledgerepresentationforsystemicphysiologyanalysis