Cargando…

GRank: a middleware search engine for ranking genes by relevance to given genes

BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in ord...

Descripción completa

Detalles Bibliográficos
Autores principales: Taha, Kamal, Homouz, Dirar, Al Muhairi, Hassan, Al Mahmoud, Zaid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765412/
https://www.ncbi.nlm.nih.gov/pubmed/23957362
http://dx.doi.org/10.1186/1471-2105-14-251
_version_ 1782283303192100864
author Taha, Kamal
Homouz, Dirar
Al Muhairi, Hassan
Al Mahmoud, Zaid
author_facet Taha, Kamal
Homouz, Dirar
Al Muhairi, Hassan
Al Mahmoud, Zaid
author_sort Taha, Kamal
collection PubMed
description BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in order to identify semantically related genes. Others propose tools that return the set of genes that are semantically related to a given set of genes. Most of these gene similarity measures determine the semantic similarities among the genes based solely on the proximity to each other of the GO terms annotating the genes, while overlook the structural dependencies among these GO terms, which may lead to low recall and precision of results. RESULTS: We propose in this paper a search engine called GRank, which overcomes the limitations of the current gene similarity measures outlined above as follows. It employs the concept of existence dependency to determine the structural dependencies among the GO terms annotating a given set of gene. After determining the set of genes that are semantically related to input genes, GRank would use microarray experiment to rank these genes based on their degree of relativity to the input genes. We evaluated GRank experimentally and compared it with a comparable gene prediction tool called DynGO, which retrieves the genes and gene products that are relatives of input genes. Results showed marked improvement. CONCLUSIONS: The experimental results demonstrated that GRank overcomes the limitations of current gene similarity measures. We attribute this performance to GRank’s use of existence dependency concept for determining the semantic relationships among gene annotations. The recall and precision values for two benchmarking datasets showed that GRank outperforms DynGO tool, which does not employ the concept of existence dependency. The demo of GRank using 11000 KEGG yeast genes and a Gene Expression Omnibus (GEO) microarray file named “GSM34635.pad” is available at: http://ecesrvr.kustar.ac.ae:8080/ (click on the link labelled Gene Ontology 2).
format Online
Article
Text
id pubmed-3765412
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37654122013-09-10 GRank: a middleware search engine for ranking genes by relevance to given genes Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid BMC Bioinformatics Research Article BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in order to identify semantically related genes. Others propose tools that return the set of genes that are semantically related to a given set of genes. Most of these gene similarity measures determine the semantic similarities among the genes based solely on the proximity to each other of the GO terms annotating the genes, while overlook the structural dependencies among these GO terms, which may lead to low recall and precision of results. RESULTS: We propose in this paper a search engine called GRank, which overcomes the limitations of the current gene similarity measures outlined above as follows. It employs the concept of existence dependency to determine the structural dependencies among the GO terms annotating a given set of gene. After determining the set of genes that are semantically related to input genes, GRank would use microarray experiment to rank these genes based on their degree of relativity to the input genes. We evaluated GRank experimentally and compared it with a comparable gene prediction tool called DynGO, which retrieves the genes and gene products that are relatives of input genes. Results showed marked improvement. CONCLUSIONS: The experimental results demonstrated that GRank overcomes the limitations of current gene similarity measures. We attribute this performance to GRank’s use of existence dependency concept for determining the semantic relationships among gene annotations. The recall and precision values for two benchmarking datasets showed that GRank outperforms DynGO tool, which does not employ the concept of existence dependency. The demo of GRank using 11000 KEGG yeast genes and a Gene Expression Omnibus (GEO) microarray file named “GSM34635.pad” is available at: http://ecesrvr.kustar.ac.ae:8080/ (click on the link labelled Gene Ontology 2). BioMed Central 2013-08-19 /pmc/articles/PMC3765412/ /pubmed/23957362 http://dx.doi.org/10.1186/1471-2105-14-251 Text en Copyright © 2013 Taha et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Taha, Kamal
Homouz, Dirar
Al Muhairi, Hassan
Al Mahmoud, Zaid
GRank: a middleware search engine for ranking genes by relevance to given genes
title GRank: a middleware search engine for ranking genes by relevance to given genes
title_full GRank: a middleware search engine for ranking genes by relevance to given genes
title_fullStr GRank: a middleware search engine for ranking genes by relevance to given genes
title_full_unstemmed GRank: a middleware search engine for ranking genes by relevance to given genes
title_short GRank: a middleware search engine for ranking genes by relevance to given genes
title_sort grank: a middleware search engine for ranking genes by relevance to given genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765412/
https://www.ncbi.nlm.nih.gov/pubmed/23957362
http://dx.doi.org/10.1186/1471-2105-14-251
work_keys_str_mv AT tahakamal grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes
AT homouzdirar grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes
AT almuhairihassan grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes
AT almahmoudzaid grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes