Cargando…

Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph

Motivation: The results of initial analyses for many high-throughput technologies commonly take the form of gene or protein sets, and one of the ensuing tasks is to evaluate the functional coherence of these sets. The study of gene set function most commonly makes use of controlled vocabulary in the...

Descripción completa

Detalles Bibliográficos
Autores principales: Richards, Adam J., Muller, Brian, Shotwell, Matthew, Cowart, L. Ashley, Rohrer, Bäerbel, Lu, Xinghua
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881388/
https://www.ncbi.nlm.nih.gov/pubmed/20529941
http://dx.doi.org/10.1093/bioinformatics/btq203
_version_ 1782182112782188544
author Richards, Adam J.
Muller, Brian
Shotwell, Matthew
Cowart, L. Ashley
Rohrer, Bäerbel
Lu, Xinghua
author_facet Richards, Adam J.
Muller, Brian
Shotwell, Matthew
Cowart, L. Ashley
Rohrer, Bäerbel
Lu, Xinghua
author_sort Richards, Adam J.
collection PubMed
description Motivation: The results of initial analyses for many high-throughput technologies commonly take the form of gene or protein sets, and one of the ensuing tasks is to evaluate the functional coherence of these sets. The study of gene set function most commonly makes use of controlled vocabulary in the form of ontology annotations. For a given gene set, the statistical significance of observing these annotations or ‘enrichment’ may be tested using a number of methods. Instead of testing for significance of individual terms, this study is concerned with the task of assessing the global functional coherence of gene sets, for which novel metrics and statistical methods have been devised. Results: The metrics of this study are based on the topological properties of graphs comprised of genes and their Gene Ontology annotations. A novel aspect of these methods is that both the enrichment of annotations and the relationships among annotations are considered when determining the significance of functional coherence. We applied our methods to perform analyses on an existing database and on microarray experimental results. Here, we demonstrated that our approach is highly discriminative in terms of differentiating coherent gene sets from random ones and that it provides biologically sensible evaluations in microarray analysis. We further used examples to show the utility of graph visualization as a tool for studying the functional coherence of gene sets. Availability: The implementation is provided as a freely accessible web application at: http://projects.dbbe.musc.edu/gosteiner. Additionally, the source code written in the Python programming language, is available under the General Public License of the Free Software Foundation. Contact: lux@musc.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2881388
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28813882010-06-08 Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph Richards, Adam J. Muller, Brian Shotwell, Matthew Cowart, L. Ashley Rohrer, Bäerbel Lu, Xinghua Bioinformatics Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Motivation: The results of initial analyses for many high-throughput technologies commonly take the form of gene or protein sets, and one of the ensuing tasks is to evaluate the functional coherence of these sets. The study of gene set function most commonly makes use of controlled vocabulary in the form of ontology annotations. For a given gene set, the statistical significance of observing these annotations or ‘enrichment’ may be tested using a number of methods. Instead of testing for significance of individual terms, this study is concerned with the task of assessing the global functional coherence of gene sets, for which novel metrics and statistical methods have been devised. Results: The metrics of this study are based on the topological properties of graphs comprised of genes and their Gene Ontology annotations. A novel aspect of these methods is that both the enrichment of annotations and the relationships among annotations are considered when determining the significance of functional coherence. We applied our methods to perform analyses on an existing database and on microarray experimental results. Here, we demonstrated that our approach is highly discriminative in terms of differentiating coherent gene sets from random ones and that it provides biologically sensible evaluations in microarray analysis. We further used examples to show the utility of graph visualization as a tool for studying the functional coherence of gene sets. Availability: The implementation is provided as a freely accessible web application at: http://projects.dbbe.musc.edu/gosteiner. Additionally, the source code written in the Python programming language, is available under the General Public License of the Free Software Foundation. Contact: lux@musc.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-06-15 2010-06-01 /pmc/articles/PMC2881388/ /pubmed/20529941 http://dx.doi.org/10.1093/bioinformatics/btq203 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
Richards, Adam J.
Muller, Brian
Shotwell, Matthew
Cowart, L. Ashley
Rohrer, Bäerbel
Lu, Xinghua
Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph
title Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph
title_full Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph
title_fullStr Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph
title_full_unstemmed Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph
title_short Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph
title_sort assessing the functional coherence of gene sets with metrics based on the gene ontology graph
topic Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881388/
https://www.ncbi.nlm.nih.gov/pubmed/20529941
http://dx.doi.org/10.1093/bioinformatics/btq203
work_keys_str_mv AT richardsadamj assessingthefunctionalcoherenceofgenesetswithmetricsbasedonthegeneontologygraph
AT mullerbrian assessingthefunctionalcoherenceofgenesetswithmetricsbasedonthegeneontologygraph
AT shotwellmatthew assessingthefunctionalcoherenceofgenesetswithmetricsbasedonthegeneontologygraph
AT cowartlashley assessingthefunctionalcoherenceofgenesetswithmetricsbasedonthegeneontologygraph
AT rohrerbaerbel assessingthefunctionalcoherenceofgenesetswithmetricsbasedonthegeneontologygraph
AT luxinghua assessingthefunctionalcoherenceofgenesetswithmetricsbasedonthegeneontologygraph