Cargando…

Markov Chain Ontology Analysis (MCOA)

BACKGROUND: Biomedical ontologies have become an increasingly critical lens through which researchers analyze the genomic, clinical and bibliographic data that fuels scientific research. Of particular relevance are methods, such as enrichment analysis, that quantify the importance of ontology classe...

Descripción completa

Detalles Bibliográficos
Autores principales: Frost, H Robert, McCray, Alexa T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3329418/
https://www.ncbi.nlm.nih.gov/pubmed/22300537
http://dx.doi.org/10.1186/1471-2105-13-23
_version_ 1782229835486068736
author Frost, H Robert
McCray, Alexa T
author_facet Frost, H Robert
McCray, Alexa T
author_sort Frost, H Robert
collection PubMed
description BACKGROUND: Biomedical ontologies have become an increasingly critical lens through which researchers analyze the genomic, clinical and bibliographic data that fuels scientific research. Of particular relevance are methods, such as enrichment analysis, that quantify the importance of ontology classes relative to a collection of domain data. Current analytical techniques, however, remain limited in their ability to handle many important types of structural complexity encountered in real biological systems including class overlaps, continuously valued data, inter-instance relationships, non-hierarchical relationships between classes, semantic distance and sparse data. RESULTS: In this paper, we describe a methodology called Markov Chain Ontology Analysis (MCOA) and illustrate its use through a MCOA-based enrichment analysis application based on a generative model of gene activation. MCOA models the classes in an ontology, the instances from an associated dataset and all directional inter-class, class-to-instance and inter-instance relationships as a single finite ergodic Markov chain. The adjusted transition probability matrix for this Markov chain enables the calculation of eigenvector values that quantify the importance of each ontology class relative to other classes and the associated data set members. On both controlled Gene Ontology (GO) data sets created with Escherichia coli, Drosophila melanogaster and Homo sapiens annotations and real gene expression data extracted from the Gene Expression Omnibus (GEO), the MCOA enrichment analysis approach provides the best performance of comparable state-of-the-art methods. CONCLUSION: A methodology based on Markov chain models and network analytic metrics can help detect the relevant signal within large, highly interdependent and noisy data sets and, for applications such as enrichment analysis, has been shown to generate superior performance on both real and simulated data relative to existing state-of-the-art approaches.
format Online
Article
Text
id pubmed-3329418
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33294182012-04-23 Markov Chain Ontology Analysis (MCOA) Frost, H Robert McCray, Alexa T BMC Bioinformatics Methodology Article BACKGROUND: Biomedical ontologies have become an increasingly critical lens through which researchers analyze the genomic, clinical and bibliographic data that fuels scientific research. Of particular relevance are methods, such as enrichment analysis, that quantify the importance of ontology classes relative to a collection of domain data. Current analytical techniques, however, remain limited in their ability to handle many important types of structural complexity encountered in real biological systems including class overlaps, continuously valued data, inter-instance relationships, non-hierarchical relationships between classes, semantic distance and sparse data. RESULTS: In this paper, we describe a methodology called Markov Chain Ontology Analysis (MCOA) and illustrate its use through a MCOA-based enrichment analysis application based on a generative model of gene activation. MCOA models the classes in an ontology, the instances from an associated dataset and all directional inter-class, class-to-instance and inter-instance relationships as a single finite ergodic Markov chain. The adjusted transition probability matrix for this Markov chain enables the calculation of eigenvector values that quantify the importance of each ontology class relative to other classes and the associated data set members. On both controlled Gene Ontology (GO) data sets created with Escherichia coli, Drosophila melanogaster and Homo sapiens annotations and real gene expression data extracted from the Gene Expression Omnibus (GEO), the MCOA enrichment analysis approach provides the best performance of comparable state-of-the-art methods. CONCLUSION: A methodology based on Markov chain models and network analytic metrics can help detect the relevant signal within large, highly interdependent and noisy data sets and, for applications such as enrichment analysis, has been shown to generate superior performance on both real and simulated data relative to existing state-of-the-art approaches. BioMed Central 2012-02-03 /pmc/articles/PMC3329418/ /pubmed/22300537 http://dx.doi.org/10.1186/1471-2105-13-23 Text en Copyright ©2012 Frost and McCray; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Frost, H Robert
McCray, Alexa T
Markov Chain Ontology Analysis (MCOA)
title Markov Chain Ontology Analysis (MCOA)
title_full Markov Chain Ontology Analysis (MCOA)
title_fullStr Markov Chain Ontology Analysis (MCOA)
title_full_unstemmed Markov Chain Ontology Analysis (MCOA)
title_short Markov Chain Ontology Analysis (MCOA)
title_sort markov chain ontology analysis (mcoa)
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3329418/
https://www.ncbi.nlm.nih.gov/pubmed/22300537
http://dx.doi.org/10.1186/1471-2105-13-23
work_keys_str_mv AT frosthrobert markovchainontologyanalysismcoa
AT mccrayalexat markovchainontologyanalysismcoa