Cargando…

GRank: a middleware search engine for ranking genes by relevance to given genes

BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in ord...

Descripción completa

Detalles Bibliográficos
Autores principales:	Taha, Kamal, Homouz, Dirar, Al Muhairi, Hassan, Al Mahmoud, Zaid
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765412/ https://www.ncbi.nlm.nih.gov/pubmed/23957362 http://dx.doi.org/10.1186/1471-2105-14-251

_version_	1782283303192100864
author	Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid
author_facet	Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid
author_sort	Taha, Kamal
collection	PubMed
description	BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in order to identify semantically related genes. Others propose tools that return the set of genes that are semantically related to a given set of genes. Most of these gene similarity measures determine the semantic similarities among the genes based solely on the proximity to each other of the GO terms annotating the genes, while overlook the structural dependencies among these GO terms, which may lead to low recall and precision of results. RESULTS: We propose in this paper a search engine called GRank, which overcomes the limitations of the current gene similarity measures outlined above as follows. It employs the concept of existence dependency to determine the structural dependencies among the GO terms annotating a given set of gene. After determining the set of genes that are semantically related to input genes, GRank would use microarray experiment to rank these genes based on their degree of relativity to the input genes. We evaluated GRank experimentally and compared it with a comparable gene prediction tool called DynGO, which retrieves the genes and gene products that are relatives of input genes. Results showed marked improvement. CONCLUSIONS: The experimental results demonstrated that GRank overcomes the limitations of current gene similarity measures. We attribute this performance to GRank’s use of existence dependency concept for determining the semantic relationships among gene annotations. The recall and precision values for two benchmarking datasets showed that GRank outperforms DynGO tool, which does not employ the concept of existence dependency. The demo of GRank using 11000 KEGG yeast genes and a Gene Expression Omnibus (GEO) microarray file named “GSM34635.pad” is available at: http://ecesrvr.kustar.ac.ae:8080/ (click on the link labelled Gene Ontology 2).
format	Online Article Text
id	pubmed-3765412
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-37654122013-09-10 GRank: a middleware search engine for ranking genes by relevance to given genes Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid BMC Bioinformatics Research Article BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in order to identify semantically related genes. Others propose tools that return the set of genes that are semantically related to a given set of genes. Most of these gene similarity measures determine the semantic similarities among the genes based solely on the proximity to each other of the GO terms annotating the genes, while overlook the structural dependencies among these GO terms, which may lead to low recall and precision of results. RESULTS: We propose in this paper a search engine called GRank, which overcomes the limitations of the current gene similarity measures outlined above as follows. It employs the concept of existence dependency to determine the structural dependencies among the GO terms annotating a given set of gene. After determining the set of genes that are semantically related to input genes, GRank would use microarray experiment to rank these genes based on their degree of relativity to the input genes. We evaluated GRank experimentally and compared it with a comparable gene prediction tool called DynGO, which retrieves the genes and gene products that are relatives of input genes. Results showed marked improvement. CONCLUSIONS: The experimental results demonstrated that GRank overcomes the limitations of current gene similarity measures. We attribute this performance to GRank’s use of existence dependency concept for determining the semantic relationships among gene annotations. The recall and precision values for two benchmarking datasets showed that GRank outperforms DynGO tool, which does not employ the concept of existence dependency. The demo of GRank using 11000 KEGG yeast genes and a Gene Expression Omnibus (GEO) microarray file named “GSM34635.pad” is available at: http://ecesrvr.kustar.ac.ae:8080/ (click on the link labelled Gene Ontology 2). BioMed Central 2013-08-19 /pmc/articles/PMC3765412/ /pubmed/23957362 http://dx.doi.org/10.1186/1471-2105-14-251 Text en Copyright © 2013 Taha et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid GRank: a middleware search engine for ranking genes by relevance to given genes
title	GRank: a middleware search engine for ranking genes by relevance to given genes
title_full	GRank: a middleware search engine for ranking genes by relevance to given genes
title_fullStr	GRank: a middleware search engine for ranking genes by relevance to given genes
title_full_unstemmed	GRank: a middleware search engine for ranking genes by relevance to given genes
title_short	GRank: a middleware search engine for ranking genes by relevance to given genes
title_sort	grank: a middleware search engine for ranking genes by relevance to given genes
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765412/ https://www.ncbi.nlm.nih.gov/pubmed/23957362 http://dx.doi.org/10.1186/1471-2105-14-251
work_keys_str_mv	AT tahakamal grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes AT homouzdirar grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes AT almuhairihassan grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes AT almahmoudzaid grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes

GRank: a middleware search engine for ranking genes by relevance to given genes

Ejemplares similares