Cargando…

Onto-CC: a web server for identifying Gene Ontology conceptual clusters

The Gene Ontology (GO) vocabulary has been extensively explored to analyze the functions of coexpressed genes. However, despite its extended use in Biology and Medical Sciences, there are still high levels of uncertainty about which ontology (i.e. Molecular Process, Cellular Component or Molecular F...

Descripción completa

Detalles Bibliográficos
Autores principales: Romero-Zaliz, R., del Val, C., Cobb, J. P., Zwir, I.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447763/
https://www.ncbi.nlm.nih.gov/pubmed/18544607
http://dx.doi.org/10.1093/nar/gkn323
_version_ 1782156993722580992
author Romero-Zaliz, R.
del Val, C.
Cobb, J. P.
Zwir, I.
author_facet Romero-Zaliz, R.
del Val, C.
Cobb, J. P.
Zwir, I.
author_sort Romero-Zaliz, R.
collection PubMed
description The Gene Ontology (GO) vocabulary has been extensively explored to analyze the functions of coexpressed genes. However, despite its extended use in Biology and Medical Sciences, there are still high levels of uncertainty about which ontology (i.e. Molecular Process, Cellular Component or Molecular Function) should be used, and at which level of specificity. Moreover, the GO database can contain incomplete information resulting from human annotations, or highly influenced by the available knowledge about a specific branch in an ontology. In spite of these drawbacks, there is a trend to ignore these problems and even use GO terms to conduct searches of gene expression profiles (i.e. expression + GO) instead of more cautious approaches that just consider them as an independent source of validation (i.e. expression versus GO). Consequently, propagating the uncertainty and producing biased analysis of the required gene grouping hypotheses. We proposed a web tool, Onto-CC, as an automatic method specially suited for independent explanation/validation of gene grouping hypotheses (e.g. coexpressed genes) based on GO clusters (i.e. expression versus GO). Onto-CC approach reduces the uncertainty of the queries by identifying optimal conceptual clusters that combine terms from different ontologies simultaneously, as well as terms defined at different levels of specificity in the GO hierarchy. To do so, we implemented the EMO-CC methodology to find clusters in structural databases [GO Directed acyclic Graph (DAG) tree], inspired on Conceptual Clustering algorithms. This approach allows the management of optimal cluster sets as potential parallel hypotheses, guided by multiobjective/multimodal optimization techniques. Therefore, we can generate alternative and, still, optimal explanations of queries that can provide new insights for a given problem. Onto-CC has been successfully used to test different medical and biological hypotheses including the explanation and prediction of gene expression profiles resulting from the host response to injuries in the inflammatory problem. Onto-CC provides two versions: Ready2GO, a precalculated EMO-CC for several genomes and an Advanced Onto-CC for custom annotation files (http://gps-tools2.wustl.edu/onto-cc/index.html).
format Text
id pubmed-2447763
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-24477632008-07-09 Onto-CC: a web server for identifying Gene Ontology conceptual clusters Romero-Zaliz, R. del Val, C. Cobb, J. P. Zwir, I. Nucleic Acids Res Articles The Gene Ontology (GO) vocabulary has been extensively explored to analyze the functions of coexpressed genes. However, despite its extended use in Biology and Medical Sciences, there are still high levels of uncertainty about which ontology (i.e. Molecular Process, Cellular Component or Molecular Function) should be used, and at which level of specificity. Moreover, the GO database can contain incomplete information resulting from human annotations, or highly influenced by the available knowledge about a specific branch in an ontology. In spite of these drawbacks, there is a trend to ignore these problems and even use GO terms to conduct searches of gene expression profiles (i.e. expression + GO) instead of more cautious approaches that just consider them as an independent source of validation (i.e. expression versus GO). Consequently, propagating the uncertainty and producing biased analysis of the required gene grouping hypotheses. We proposed a web tool, Onto-CC, as an automatic method specially suited for independent explanation/validation of gene grouping hypotheses (e.g. coexpressed genes) based on GO clusters (i.e. expression versus GO). Onto-CC approach reduces the uncertainty of the queries by identifying optimal conceptual clusters that combine terms from different ontologies simultaneously, as well as terms defined at different levels of specificity in the GO hierarchy. To do so, we implemented the EMO-CC methodology to find clusters in structural databases [GO Directed acyclic Graph (DAG) tree], inspired on Conceptual Clustering algorithms. This approach allows the management of optimal cluster sets as potential parallel hypotheses, guided by multiobjective/multimodal optimization techniques. Therefore, we can generate alternative and, still, optimal explanations of queries that can provide new insights for a given problem. Onto-CC has been successfully used to test different medical and biological hypotheses including the explanation and prediction of gene expression profiles resulting from the host response to injuries in the inflammatory problem. Onto-CC provides two versions: Ready2GO, a precalculated EMO-CC for several genomes and an Advanced Onto-CC for custom annotation files (http://gps-tools2.wustl.edu/onto-cc/index.html). Oxford University Press 2008-07-01 2008-06-10 /pmc/articles/PMC2447763/ /pubmed/18544607 http://dx.doi.org/10.1093/nar/gkn323 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Romero-Zaliz, R.
del Val, C.
Cobb, J. P.
Zwir, I.
Onto-CC: a web server for identifying Gene Ontology conceptual clusters
title Onto-CC: a web server for identifying Gene Ontology conceptual clusters
title_full Onto-CC: a web server for identifying Gene Ontology conceptual clusters
title_fullStr Onto-CC: a web server for identifying Gene Ontology conceptual clusters
title_full_unstemmed Onto-CC: a web server for identifying Gene Ontology conceptual clusters
title_short Onto-CC: a web server for identifying Gene Ontology conceptual clusters
title_sort onto-cc: a web server for identifying gene ontology conceptual clusters
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447763/
https://www.ncbi.nlm.nih.gov/pubmed/18544607
http://dx.doi.org/10.1093/nar/gkn323
work_keys_str_mv AT romerozalizr ontoccawebserverforidentifyinggeneontologyconceptualclusters
AT delvalc ontoccawebserverforidentifyinggeneontologyconceptualclusters
AT cobbjp ontoccawebserverforidentifyinggeneontologyconceptualclusters
AT zwiri ontoccawebserverforidentifyinggeneontologyconceptualclusters