Cargando…

GO-based Functional Dissimilarity of Gene Sets

BACKGROUND: The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise di...

Descripción completa

Detalles Bibliográficos
Autores principales: Díaz-Díaz, Norberto, Aguilar-Ruiz, Jesús S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3248071/
https://www.ncbi.nlm.nih.gov/pubmed/21884611
http://dx.doi.org/10.1186/1471-2105-12-360
_version_ 1782220194370813952
author Díaz-Díaz, Norberto
Aguilar-Ruiz, Jesús S
author_facet Díaz-Díaz, Norberto
Aguilar-Ruiz, Jesús S
author_sort Díaz-Díaz, Norberto
collection PubMed
description BACKGROUND: The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together. RESULTS: To implement this approach to functional assessment, we present GFD (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies. CONCLUSIONS: Results show that GFD performs robustly when applied to gene set of known functionality (extracted from KEGG). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of GFD in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS(2 )and those presented by Resnik and Wang, also demonstrates the robustness of GFD.
format Online
Article
Text
id pubmed-3248071
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32480712011-12-30 GO-based Functional Dissimilarity of Gene Sets Díaz-Díaz, Norberto Aguilar-Ruiz, Jesús S BMC Bioinformatics Methodology Article BACKGROUND: The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together. RESULTS: To implement this approach to functional assessment, we present GFD (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies. CONCLUSIONS: Results show that GFD performs robustly when applied to gene set of known functionality (extracted from KEGG). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of GFD in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS(2 )and those presented by Resnik and Wang, also demonstrates the robustness of GFD. BioMed Central 2011-09-01 /pmc/articles/PMC3248071/ /pubmed/21884611 http://dx.doi.org/10.1186/1471-2105-12-360 Text en Copyright ©2011 Díaz-Díaz and Aguilar-Ruiz; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Díaz-Díaz, Norberto
Aguilar-Ruiz, Jesús S
GO-based Functional Dissimilarity of Gene Sets
title GO-based Functional Dissimilarity of Gene Sets
title_full GO-based Functional Dissimilarity of Gene Sets
title_fullStr GO-based Functional Dissimilarity of Gene Sets
title_full_unstemmed GO-based Functional Dissimilarity of Gene Sets
title_short GO-based Functional Dissimilarity of Gene Sets
title_sort go-based functional dissimilarity of gene sets
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3248071/
https://www.ncbi.nlm.nih.gov/pubmed/21884611
http://dx.doi.org/10.1186/1471-2105-12-360
work_keys_str_mv AT diazdiaznorberto gobasedfunctionaldissimilarityofgenesets
AT aguilarruizjesuss gobasedfunctionaldissimilarityofgenesets