Cargando…

Gene analogue finder: a GRID solution for finding functionally analogous gene products

BACKGROUND: To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO). Such vast, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Tulipano, Angelica, Donvito, Giacinto, Licciulli, Flavio, Maggi, Giorgio, Gisel, Andreas
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2020485/
https://www.ncbi.nlm.nih.gov/pubmed/17767718
http://dx.doi.org/10.1186/1471-2105-8-329
_version_ 1782136605401677824
author Tulipano, Angelica
Donvito, Giacinto
Licciulli, Flavio
Maggi, Giorgio
Gisel, Andreas
author_facet Tulipano, Angelica
Donvito, Giacinto
Licciulli, Flavio
Maggi, Giorgio
Gisel, Andreas
author_sort Tulipano, Angelica
collection PubMed
description BACKGROUND: To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO). Such vast, well defined knowledge opens the possibility of compare gene products at the level of functionality, finding gene products which have a similar function or are involved in similar biological processes without relying on the conventional sequence similarity approach. Comparisons within such a large space of knowledge are highly data and computing intensive. For this reason this project was based upon the use of the computational GRID, a technology offering large computing and storage resources. RESULTS: We have developed a tool, GENe AnaloGue FINdEr (ENGINE) that parallelizes the search process and distributes the calculation and data over the computational GRID, splitting the process into many sub-processes and joining the calculation and the data on the same machine and therefore completing the whole search in about 3 days instead of occupying one single machine for more than 5 CPU years. The results of the functional comparison contain potential functional analogues for more than 79000 gene products from the most important species. 46% of the analyzed gene products are well enough described for such an analysis to individuate functional analogues, such as well-known members of the same gene family, or gene products with similar functions which would never have been associated by standard methods. CONCLUSION: ENGINE has produced a list of potential functionally analogous relations between gene products within and between species using, in place of the sequence, the gene description of the GO, thus demonstrating the potential of the GO. However, the current limiting factor is the quality of the associations of many gene products from non-model organisms that often have electronic associations, since experimental information is missing. With future improvements of the GO, this limit will be reduced. ENGINE will manifest its power when it is applied to the whole GODB of more than 2,1 million gene products from more than 100000 organisms. The data produced by this search is planed to be available as a supplement to the GO database as soon as we are able to provide regular updates.
format Text
id pubmed-2020485
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-20204852007-10-13 Gene analogue finder: a GRID solution for finding functionally analogous gene products Tulipano, Angelica Donvito, Giacinto Licciulli, Flavio Maggi, Giorgio Gisel, Andreas BMC Bioinformatics Research Article BACKGROUND: To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO). Such vast, well defined knowledge opens the possibility of compare gene products at the level of functionality, finding gene products which have a similar function or are involved in similar biological processes without relying on the conventional sequence similarity approach. Comparisons within such a large space of knowledge are highly data and computing intensive. For this reason this project was based upon the use of the computational GRID, a technology offering large computing and storage resources. RESULTS: We have developed a tool, GENe AnaloGue FINdEr (ENGINE) that parallelizes the search process and distributes the calculation and data over the computational GRID, splitting the process into many sub-processes and joining the calculation and the data on the same machine and therefore completing the whole search in about 3 days instead of occupying one single machine for more than 5 CPU years. The results of the functional comparison contain potential functional analogues for more than 79000 gene products from the most important species. 46% of the analyzed gene products are well enough described for such an analysis to individuate functional analogues, such as well-known members of the same gene family, or gene products with similar functions which would never have been associated by standard methods. CONCLUSION: ENGINE has produced a list of potential functionally analogous relations between gene products within and between species using, in place of the sequence, the gene description of the GO, thus demonstrating the potential of the GO. However, the current limiting factor is the quality of the associations of many gene products from non-model organisms that often have electronic associations, since experimental information is missing. With future improvements of the GO, this limit will be reduced. ENGINE will manifest its power when it is applied to the whole GODB of more than 2,1 million gene products from more than 100000 organisms. The data produced by this search is planed to be available as a supplement to the GO database as soon as we are able to provide regular updates. BioMed Central 2007-09-03 /pmc/articles/PMC2020485/ /pubmed/17767718 http://dx.doi.org/10.1186/1471-2105-8-329 Text en Copyright © 2007 Tulipano et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tulipano, Angelica
Donvito, Giacinto
Licciulli, Flavio
Maggi, Giorgio
Gisel, Andreas
Gene analogue finder: a GRID solution for finding functionally analogous gene products
title Gene analogue finder: a GRID solution for finding functionally analogous gene products
title_full Gene analogue finder: a GRID solution for finding functionally analogous gene products
title_fullStr Gene analogue finder: a GRID solution for finding functionally analogous gene products
title_full_unstemmed Gene analogue finder: a GRID solution for finding functionally analogous gene products
title_short Gene analogue finder: a GRID solution for finding functionally analogous gene products
title_sort gene analogue finder: a grid solution for finding functionally analogous gene products
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2020485/
https://www.ncbi.nlm.nih.gov/pubmed/17767718
http://dx.doi.org/10.1186/1471-2105-8-329
work_keys_str_mv AT tulipanoangelica geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts
AT donvitogiacinto geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts
AT licciulliflavio geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts
AT maggigiorgio geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts
AT giselandreas geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts