Cargando…
Gene analogue finder: a GRID solution for finding functionally analogous gene products
BACKGROUND: To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO). Such vast, we...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2020485/ https://www.ncbi.nlm.nih.gov/pubmed/17767718 http://dx.doi.org/10.1186/1471-2105-8-329 |
_version_ | 1782136605401677824 |
---|---|
author | Tulipano, Angelica Donvito, Giacinto Licciulli, Flavio Maggi, Giorgio Gisel, Andreas |
author_facet | Tulipano, Angelica Donvito, Giacinto Licciulli, Flavio Maggi, Giorgio Gisel, Andreas |
author_sort | Tulipano, Angelica |
collection | PubMed |
description | BACKGROUND: To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO). Such vast, well defined knowledge opens the possibility of compare gene products at the level of functionality, finding gene products which have a similar function or are involved in similar biological processes without relying on the conventional sequence similarity approach. Comparisons within such a large space of knowledge are highly data and computing intensive. For this reason this project was based upon the use of the computational GRID, a technology offering large computing and storage resources. RESULTS: We have developed a tool, GENe AnaloGue FINdEr (ENGINE) that parallelizes the search process and distributes the calculation and data over the computational GRID, splitting the process into many sub-processes and joining the calculation and the data on the same machine and therefore completing the whole search in about 3 days instead of occupying one single machine for more than 5 CPU years. The results of the functional comparison contain potential functional analogues for more than 79000 gene products from the most important species. 46% of the analyzed gene products are well enough described for such an analysis to individuate functional analogues, such as well-known members of the same gene family, or gene products with similar functions which would never have been associated by standard methods. CONCLUSION: ENGINE has produced a list of potential functionally analogous relations between gene products within and between species using, in place of the sequence, the gene description of the GO, thus demonstrating the potential of the GO. However, the current limiting factor is the quality of the associations of many gene products from non-model organisms that often have electronic associations, since experimental information is missing. With future improvements of the GO, this limit will be reduced. ENGINE will manifest its power when it is applied to the whole GODB of more than 2,1 million gene products from more than 100000 organisms. The data produced by this search is planed to be available as a supplement to the GO database as soon as we are able to provide regular updates. |
format | Text |
id | pubmed-2020485 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-20204852007-10-13 Gene analogue finder: a GRID solution for finding functionally analogous gene products Tulipano, Angelica Donvito, Giacinto Licciulli, Flavio Maggi, Giorgio Gisel, Andreas BMC Bioinformatics Research Article BACKGROUND: To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO). Such vast, well defined knowledge opens the possibility of compare gene products at the level of functionality, finding gene products which have a similar function or are involved in similar biological processes without relying on the conventional sequence similarity approach. Comparisons within such a large space of knowledge are highly data and computing intensive. For this reason this project was based upon the use of the computational GRID, a technology offering large computing and storage resources. RESULTS: We have developed a tool, GENe AnaloGue FINdEr (ENGINE) that parallelizes the search process and distributes the calculation and data over the computational GRID, splitting the process into many sub-processes and joining the calculation and the data on the same machine and therefore completing the whole search in about 3 days instead of occupying one single machine for more than 5 CPU years. The results of the functional comparison contain potential functional analogues for more than 79000 gene products from the most important species. 46% of the analyzed gene products are well enough described for such an analysis to individuate functional analogues, such as well-known members of the same gene family, or gene products with similar functions which would never have been associated by standard methods. CONCLUSION: ENGINE has produced a list of potential functionally analogous relations between gene products within and between species using, in place of the sequence, the gene description of the GO, thus demonstrating the potential of the GO. However, the current limiting factor is the quality of the associations of many gene products from non-model organisms that often have electronic associations, since experimental information is missing. With future improvements of the GO, this limit will be reduced. ENGINE will manifest its power when it is applied to the whole GODB of more than 2,1 million gene products from more than 100000 organisms. The data produced by this search is planed to be available as a supplement to the GO database as soon as we are able to provide regular updates. BioMed Central 2007-09-03 /pmc/articles/PMC2020485/ /pubmed/17767718 http://dx.doi.org/10.1186/1471-2105-8-329 Text en Copyright © 2007 Tulipano et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Tulipano, Angelica Donvito, Giacinto Licciulli, Flavio Maggi, Giorgio Gisel, Andreas Gene analogue finder: a GRID solution for finding functionally analogous gene products |
title | Gene analogue finder: a GRID solution for finding functionally analogous gene products |
title_full | Gene analogue finder: a GRID solution for finding functionally analogous gene products |
title_fullStr | Gene analogue finder: a GRID solution for finding functionally analogous gene products |
title_full_unstemmed | Gene analogue finder: a GRID solution for finding functionally analogous gene products |
title_short | Gene analogue finder: a GRID solution for finding functionally analogous gene products |
title_sort | gene analogue finder: a grid solution for finding functionally analogous gene products |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2020485/ https://www.ncbi.nlm.nih.gov/pubmed/17767718 http://dx.doi.org/10.1186/1471-2105-8-329 |
work_keys_str_mv | AT tulipanoangelica geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts AT donvitogiacinto geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts AT licciulliflavio geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts AT maggigiorgio geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts AT giselandreas geneanaloguefinderagridsolutionforfindingfunctionallyanalogousgeneproducts |