Cargando…

High performance workflow implementation for protein surface characterization using grid technology

BACKGROUND: This study concerns the development of a high performance workflow that, using grid technology, correlates different kinds of Bioinformatics data, starting from the base pairs of the nucleotide sequence to the exposed residues of the protein surface. The implementation of this workflow i...

Descripción completa

Detalles Bibliográficos
Autores principales: Merelli, Ivan, Morra, Giulia, D'Agostino, Daniele, Clematis, Andrea, Milanesi, Luciano
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866389/
https://www.ncbi.nlm.nih.gov/pubmed/16351745
http://dx.doi.org/10.1186/1471-2105-6-S4-S19
_version_ 1782133265022320640
author Merelli, Ivan
Morra, Giulia
D'Agostino, Daniele
Clematis, Andrea
Milanesi, Luciano
author_facet Merelli, Ivan
Morra, Giulia
D'Agostino, Daniele
Clematis, Andrea
Milanesi, Luciano
author_sort Merelli, Ivan
collection PubMed
description BACKGROUND: This study concerns the development of a high performance workflow that, using grid technology, correlates different kinds of Bioinformatics data, starting from the base pairs of the nucleotide sequence to the exposed residues of the protein surface. The implementation of this workflow is based on the Italian Grid.it project infrastructure, that is a network of several computational resources and storage facilities distributed at different grid sites. METHODS: Workflows are very common in Bioinformatics because they allow to process large quantities of data by delegating the management of resources to the information streaming. Grid technology optimizes the computational load during the different workflow steps, dividing the more expensive tasks into a set of small jobs. RESULTS: Grid technology allows efficient database management, a crucial problem for obtaining good results in Bioinformatics applications. The proposed workflow is implemented to integrate huge amounts of data and the results themselves must be stored into a relational database, which results as the added value to the global knowledge. CONCLUSION: A web interface has been developed to make this technology accessible to grid users. Once the workflow has started, by means of the simplified interface, it is possible to follow all the different steps throughout the data processing. Eventually, when the workflow has been terminated, the different features of the protein, like the amino acids exposed on the protein surface, can be compared with the data present in the output database.
format Text
id pubmed-1866389
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18663892007-05-11 High performance workflow implementation for protein surface characterization using grid technology Merelli, Ivan Morra, Giulia D'Agostino, Daniele Clematis, Andrea Milanesi, Luciano BMC Bioinformatics Research Article BACKGROUND: This study concerns the development of a high performance workflow that, using grid technology, correlates different kinds of Bioinformatics data, starting from the base pairs of the nucleotide sequence to the exposed residues of the protein surface. The implementation of this workflow is based on the Italian Grid.it project infrastructure, that is a network of several computational resources and storage facilities distributed at different grid sites. METHODS: Workflows are very common in Bioinformatics because they allow to process large quantities of data by delegating the management of resources to the information streaming. Grid technology optimizes the computational load during the different workflow steps, dividing the more expensive tasks into a set of small jobs. RESULTS: Grid technology allows efficient database management, a crucial problem for obtaining good results in Bioinformatics applications. The proposed workflow is implemented to integrate huge amounts of data and the results themselves must be stored into a relational database, which results as the added value to the global knowledge. CONCLUSION: A web interface has been developed to make this technology accessible to grid users. Once the workflow has started, by means of the simplified interface, it is possible to follow all the different steps throughout the data processing. Eventually, when the workflow has been terminated, the different features of the protein, like the amino acids exposed on the protein surface, can be compared with the data present in the output database. BioMed Central 2005-12-01 /pmc/articles/PMC1866389/ /pubmed/16351745 http://dx.doi.org/10.1186/1471-2105-6-S4-S19 Text en Copyright © 2005 Merelli et al; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Merelli, Ivan
Morra, Giulia
D'Agostino, Daniele
Clematis, Andrea
Milanesi, Luciano
High performance workflow implementation for protein surface characterization using grid technology
title High performance workflow implementation for protein surface characterization using grid technology
title_full High performance workflow implementation for protein surface characterization using grid technology
title_fullStr High performance workflow implementation for protein surface characterization using grid technology
title_full_unstemmed High performance workflow implementation for protein surface characterization using grid technology
title_short High performance workflow implementation for protein surface characterization using grid technology
title_sort high performance workflow implementation for protein surface characterization using grid technology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866389/
https://www.ncbi.nlm.nih.gov/pubmed/16351745
http://dx.doi.org/10.1186/1471-2105-6-S4-S19
work_keys_str_mv AT merelliivan highperformanceworkflowimplementationforproteinsurfacecharacterizationusinggridtechnology
AT morragiulia highperformanceworkflowimplementationforproteinsurfacecharacterizationusinggridtechnology
AT dagostinodaniele highperformanceworkflowimplementationforproteinsurfacecharacterizationusinggridtechnology
AT clematisandrea highperformanceworkflowimplementationforproteinsurfacecharacterizationusinggridtechnology
AT milanesiluciano highperformanceworkflowimplementationforproteinsurfacecharacterizationusinggridtechnology