Cargando…

Effect of conformation sampling strategies in genetic algorithm for multiple protein docking

BACKGROUND: Macromolecular protein complexes play important roles in a cell and their tertiary structure can help understand key biological processes of their functions. Multiple protein docking is a valuable computational tool for providing structure information of multimeric protein complexes. In...

Descripción completa

Detalles Bibliográficos
Autores principales:	Esquivel-Rodríguez, Juan, Kihara, Daisuke
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3504801/ https://www.ncbi.nlm.nih.gov/pubmed/23173833 http://dx.doi.org/10.1186/1753-6561-6-S7-S4

_version_	1782250675395100672
author	Esquivel-Rodríguez, Juan Kihara, Daisuke
author_facet	Esquivel-Rodríguez, Juan Kihara, Daisuke
author_sort	Esquivel-Rodríguez, Juan
collection	PubMed
description	BACKGROUND: Macromolecular protein complexes play important roles in a cell and their tertiary structure can help understand key biological processes of their functions. Multiple protein docking is a valuable computational tool for providing structure information of multimeric protein complexes. In a previous study we developed and implemented an algorithm for this purpose, named Multi-LZerD. This method represents a conformation of a multimeric protein complex as a graph, where nodes denote subunits and each edge connecting nodes denotes a pairwise docking conformation of the two subunits. Multi-LZerD employs a genetic algorithm to sample different topologies of the graph and pairwise transformations between subunits, seeking for the conformation of the optimal (lowest) energy. In this study we explore different configurations of the genetic algorithm, namely, the population size, whether to include a crossover operation, as well as the threshold for structural clustering, to find the optimal experimental setup. METHODS: Multi-LZerD was executed to predict the structures of three multimeric protein complexes, using different population sizes, clustering thresholds, and configurations of mutation and crossover. We analyzed the impact of varying these parameters on the computational time and the prediction accuracy. RESULTS AND CONCLUSIONS: Given that computational resources is a key for handling complexes with a large number of subunits and also for computing a large number of protein complexes in a genome-scale study, finding a proper setting for sampling the conformation space is of the utmost importance. Our results show that an excessive sampling of the conformational space by increasing the population size or by introducing the crossover operation is not necessary for improving accuracy for predicting structures of small complexes. The clustering is effective in reducing redundant pairwise predictions, which leads to successful identification of near-native conformations.
format	Online Article Text
id	pubmed-3504801
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-35048012012-11-29 Effect of conformation sampling strategies in genetic algorithm for multiple protein docking Esquivel-Rodríguez, Juan Kihara, Daisuke BMC Proc Proceedings BACKGROUND: Macromolecular protein complexes play important roles in a cell and their tertiary structure can help understand key biological processes of their functions. Multiple protein docking is a valuable computational tool for providing structure information of multimeric protein complexes. In a previous study we developed and implemented an algorithm for this purpose, named Multi-LZerD. This method represents a conformation of a multimeric protein complex as a graph, where nodes denote subunits and each edge connecting nodes denotes a pairwise docking conformation of the two subunits. Multi-LZerD employs a genetic algorithm to sample different topologies of the graph and pairwise transformations between subunits, seeking for the conformation of the optimal (lowest) energy. In this study we explore different configurations of the genetic algorithm, namely, the population size, whether to include a crossover operation, as well as the threshold for structural clustering, to find the optimal experimental setup. METHODS: Multi-LZerD was executed to predict the structures of three multimeric protein complexes, using different population sizes, clustering thresholds, and configurations of mutation and crossover. We analyzed the impact of varying these parameters on the computational time and the prediction accuracy. RESULTS AND CONCLUSIONS: Given that computational resources is a key for handling complexes with a large number of subunits and also for computing a large number of protein complexes in a genome-scale study, finding a proper setting for sampling the conformation space is of the utmost importance. Our results show that an excessive sampling of the conformational space by increasing the population size or by introducing the crossover operation is not necessary for improving accuracy for predicting structures of small complexes. The clustering is effective in reducing redundant pairwise predictions, which leads to successful identification of near-native conformations. BioMed Central 2012-11-13 /pmc/articles/PMC3504801/ /pubmed/23173833 http://dx.doi.org/10.1186/1753-6561-6-S7-S4 Text en Copyright ©2012 Esquivel-Rodríguez and Kihara; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Esquivel-Rodríguez, Juan Kihara, Daisuke Effect of conformation sampling strategies in genetic algorithm for multiple protein docking
title	Effect of conformation sampling strategies in genetic algorithm for multiple protein docking
title_full	Effect of conformation sampling strategies in genetic algorithm for multiple protein docking
title_fullStr	Effect of conformation sampling strategies in genetic algorithm for multiple protein docking
title_full_unstemmed	Effect of conformation sampling strategies in genetic algorithm for multiple protein docking
title_short	Effect of conformation sampling strategies in genetic algorithm for multiple protein docking
title_sort	effect of conformation sampling strategies in genetic algorithm for multiple protein docking
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3504801/ https://www.ncbi.nlm.nih.gov/pubmed/23173833 http://dx.doi.org/10.1186/1753-6561-6-S7-S4
work_keys_str_mv	AT esquivelrodriguezjuan effectofconformationsamplingstrategiesingeneticalgorithmformultipleproteindocking AT kiharadaisuke effectofconformationsamplingstrategiesingeneticalgorithmformultipleproteindocking

Effect of conformation sampling strategies in genetic algorithm for multiple protein docking

Ejemplares similares