Cargando…
GRank: a middleware search engine for ranking genes by relevance to given genes
BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in ord...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765412/ https://www.ncbi.nlm.nih.gov/pubmed/23957362 http://dx.doi.org/10.1186/1471-2105-14-251 |
_version_ | 1782283303192100864 |
---|---|
author | Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid |
author_facet | Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid |
author_sort | Taha, Kamal |
collection | PubMed |
description | BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in order to identify semantically related genes. Others propose tools that return the set of genes that are semantically related to a given set of genes. Most of these gene similarity measures determine the semantic similarities among the genes based solely on the proximity to each other of the GO terms annotating the genes, while overlook the structural dependencies among these GO terms, which may lead to low recall and precision of results. RESULTS: We propose in this paper a search engine called GRank, which overcomes the limitations of the current gene similarity measures outlined above as follows. It employs the concept of existence dependency to determine the structural dependencies among the GO terms annotating a given set of gene. After determining the set of genes that are semantically related to input genes, GRank would use microarray experiment to rank these genes based on their degree of relativity to the input genes. We evaluated GRank experimentally and compared it with a comparable gene prediction tool called DynGO, which retrieves the genes and gene products that are relatives of input genes. Results showed marked improvement. CONCLUSIONS: The experimental results demonstrated that GRank overcomes the limitations of current gene similarity measures. We attribute this performance to GRank’s use of existence dependency concept for determining the semantic relationships among gene annotations. The recall and precision values for two benchmarking datasets showed that GRank outperforms DynGO tool, which does not employ the concept of existence dependency. The demo of GRank using 11000 KEGG yeast genes and a Gene Expression Omnibus (GEO) microarray file named “GSM34635.pad” is available at: http://ecesrvr.kustar.ac.ae:8080/ (click on the link labelled Gene Ontology 2). |
format | Online Article Text |
id | pubmed-3765412 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37654122013-09-10 GRank: a middleware search engine for ranking genes by relevance to given genes Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid BMC Bioinformatics Research Article BACKGROUND: Biologists may need to know the set of genes that are semantically related to a given set of genes. For instance, a biologist may need to know the set of genes related to another set of genes known to be involved in a specific disease. Some works use the concept of gene clustering in order to identify semantically related genes. Others propose tools that return the set of genes that are semantically related to a given set of genes. Most of these gene similarity measures determine the semantic similarities among the genes based solely on the proximity to each other of the GO terms annotating the genes, while overlook the structural dependencies among these GO terms, which may lead to low recall and precision of results. RESULTS: We propose in this paper a search engine called GRank, which overcomes the limitations of the current gene similarity measures outlined above as follows. It employs the concept of existence dependency to determine the structural dependencies among the GO terms annotating a given set of gene. After determining the set of genes that are semantically related to input genes, GRank would use microarray experiment to rank these genes based on their degree of relativity to the input genes. We evaluated GRank experimentally and compared it with a comparable gene prediction tool called DynGO, which retrieves the genes and gene products that are relatives of input genes. Results showed marked improvement. CONCLUSIONS: The experimental results demonstrated that GRank overcomes the limitations of current gene similarity measures. We attribute this performance to GRank’s use of existence dependency concept for determining the semantic relationships among gene annotations. The recall and precision values for two benchmarking datasets showed that GRank outperforms DynGO tool, which does not employ the concept of existence dependency. The demo of GRank using 11000 KEGG yeast genes and a Gene Expression Omnibus (GEO) microarray file named “GSM34635.pad” is available at: http://ecesrvr.kustar.ac.ae:8080/ (click on the link labelled Gene Ontology 2). BioMed Central 2013-08-19 /pmc/articles/PMC3765412/ /pubmed/23957362 http://dx.doi.org/10.1186/1471-2105-14-251 Text en Copyright © 2013 Taha et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Taha, Kamal Homouz, Dirar Al Muhairi, Hassan Al Mahmoud, Zaid GRank: a middleware search engine for ranking genes by relevance to given genes |
title | GRank: a middleware search engine for ranking genes by relevance to given genes |
title_full | GRank: a middleware search engine for ranking genes by relevance to given genes |
title_fullStr | GRank: a middleware search engine for ranking genes by relevance to given genes |
title_full_unstemmed | GRank: a middleware search engine for ranking genes by relevance to given genes |
title_short | GRank: a middleware search engine for ranking genes by relevance to given genes |
title_sort | grank: a middleware search engine for ranking genes by relevance to given genes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765412/ https://www.ncbi.nlm.nih.gov/pubmed/23957362 http://dx.doi.org/10.1186/1471-2105-14-251 |
work_keys_str_mv | AT tahakamal grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes AT homouzdirar grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes AT almuhairihassan grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes AT almahmoudzaid grankamiddlewaresearchengineforrankinggenesbyrelevancetogivengenes |