Cargando…
Onto-CC: a web server for identifying Gene Ontology conceptual clusters
The Gene Ontology (GO) vocabulary has been extensively explored to analyze the functions of coexpressed genes. However, despite its extended use in Biology and Medical Sciences, there are still high levels of uncertainty about which ontology (i.e. Molecular Process, Cellular Component or Molecular F...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447763/ https://www.ncbi.nlm.nih.gov/pubmed/18544607 http://dx.doi.org/10.1093/nar/gkn323 |
_version_ | 1782156993722580992 |
---|---|
author | Romero-Zaliz, R. del Val, C. Cobb, J. P. Zwir, I. |
author_facet | Romero-Zaliz, R. del Val, C. Cobb, J. P. Zwir, I. |
author_sort | Romero-Zaliz, R. |
collection | PubMed |
description | The Gene Ontology (GO) vocabulary has been extensively explored to analyze the functions of coexpressed genes. However, despite its extended use in Biology and Medical Sciences, there are still high levels of uncertainty about which ontology (i.e. Molecular Process, Cellular Component or Molecular Function) should be used, and at which level of specificity. Moreover, the GO database can contain incomplete information resulting from human annotations, or highly influenced by the available knowledge about a specific branch in an ontology. In spite of these drawbacks, there is a trend to ignore these problems and even use GO terms to conduct searches of gene expression profiles (i.e. expression + GO) instead of more cautious approaches that just consider them as an independent source of validation (i.e. expression versus GO). Consequently, propagating the uncertainty and producing biased analysis of the required gene grouping hypotheses. We proposed a web tool, Onto-CC, as an automatic method specially suited for independent explanation/validation of gene grouping hypotheses (e.g. coexpressed genes) based on GO clusters (i.e. expression versus GO). Onto-CC approach reduces the uncertainty of the queries by identifying optimal conceptual clusters that combine terms from different ontologies simultaneously, as well as terms defined at different levels of specificity in the GO hierarchy. To do so, we implemented the EMO-CC methodology to find clusters in structural databases [GO Directed acyclic Graph (DAG) tree], inspired on Conceptual Clustering algorithms. This approach allows the management of optimal cluster sets as potential parallel hypotheses, guided by multiobjective/multimodal optimization techniques. Therefore, we can generate alternative and, still, optimal explanations of queries that can provide new insights for a given problem. Onto-CC has been successfully used to test different medical and biological hypotheses including the explanation and prediction of gene expression profiles resulting from the host response to injuries in the inflammatory problem. Onto-CC provides two versions: Ready2GO, a precalculated EMO-CC for several genomes and an Advanced Onto-CC for custom annotation files (http://gps-tools2.wustl.edu/onto-cc/index.html). |
format | Text |
id | pubmed-2447763 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-24477632008-07-09 Onto-CC: a web server for identifying Gene Ontology conceptual clusters Romero-Zaliz, R. del Val, C. Cobb, J. P. Zwir, I. Nucleic Acids Res Articles The Gene Ontology (GO) vocabulary has been extensively explored to analyze the functions of coexpressed genes. However, despite its extended use in Biology and Medical Sciences, there are still high levels of uncertainty about which ontology (i.e. Molecular Process, Cellular Component or Molecular Function) should be used, and at which level of specificity. Moreover, the GO database can contain incomplete information resulting from human annotations, or highly influenced by the available knowledge about a specific branch in an ontology. In spite of these drawbacks, there is a trend to ignore these problems and even use GO terms to conduct searches of gene expression profiles (i.e. expression + GO) instead of more cautious approaches that just consider them as an independent source of validation (i.e. expression versus GO). Consequently, propagating the uncertainty and producing biased analysis of the required gene grouping hypotheses. We proposed a web tool, Onto-CC, as an automatic method specially suited for independent explanation/validation of gene grouping hypotheses (e.g. coexpressed genes) based on GO clusters (i.e. expression versus GO). Onto-CC approach reduces the uncertainty of the queries by identifying optimal conceptual clusters that combine terms from different ontologies simultaneously, as well as terms defined at different levels of specificity in the GO hierarchy. To do so, we implemented the EMO-CC methodology to find clusters in structural databases [GO Directed acyclic Graph (DAG) tree], inspired on Conceptual Clustering algorithms. This approach allows the management of optimal cluster sets as potential parallel hypotheses, guided by multiobjective/multimodal optimization techniques. Therefore, we can generate alternative and, still, optimal explanations of queries that can provide new insights for a given problem. Onto-CC has been successfully used to test different medical and biological hypotheses including the explanation and prediction of gene expression profiles resulting from the host response to injuries in the inflammatory problem. Onto-CC provides two versions: Ready2GO, a precalculated EMO-CC for several genomes and an Advanced Onto-CC for custom annotation files (http://gps-tools2.wustl.edu/onto-cc/index.html). Oxford University Press 2008-07-01 2008-06-10 /pmc/articles/PMC2447763/ /pubmed/18544607 http://dx.doi.org/10.1093/nar/gkn323 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Romero-Zaliz, R. del Val, C. Cobb, J. P. Zwir, I. Onto-CC: a web server for identifying Gene Ontology conceptual clusters |
title | Onto-CC: a web server for identifying Gene Ontology conceptual clusters |
title_full | Onto-CC: a web server for identifying Gene Ontology conceptual clusters |
title_fullStr | Onto-CC: a web server for identifying Gene Ontology conceptual clusters |
title_full_unstemmed | Onto-CC: a web server for identifying Gene Ontology conceptual clusters |
title_short | Onto-CC: a web server for identifying Gene Ontology conceptual clusters |
title_sort | onto-cc: a web server for identifying gene ontology conceptual clusters |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447763/ https://www.ncbi.nlm.nih.gov/pubmed/18544607 http://dx.doi.org/10.1093/nar/gkn323 |
work_keys_str_mv | AT romerozalizr ontoccawebserverforidentifyinggeneontologyconceptualclusters AT delvalc ontoccawebserverforidentifyinggeneontologyconceptualclusters AT cobbjp ontoccawebserverforidentifyinggeneontologyconceptualclusters AT zwiri ontoccawebserverforidentifyinggeneontologyconceptualclusters |