Cargando…

Evaluating the accuracy of protein design using native secondary sub-structures

BACKGROUND: According to structure-dependent function of proteins, two main challenging problems called Protein Structure Prediction (PSP) and Inverse Protein Folding (IPF) are investigated. In spite of IPF essential applications, it has not been investigated as much as PSP problem. In fact, the ult...

Descripción completa

Detalles Bibliográficos
Autores principales:	Movahedi, Marziyeh, Zare-Mirakabad, Fatemeh, Arab, Seyed Shahriar
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5011913/ https://www.ncbi.nlm.nih.gov/pubmed/27597167 http://dx.doi.org/10.1186/s12859-016-1199-y

_version_	1782451919623553024
author	Movahedi, Marziyeh Zare-Mirakabad, Fatemeh Arab, Seyed Shahriar
author_facet	Movahedi, Marziyeh Zare-Mirakabad, Fatemeh Arab, Seyed Shahriar
author_sort	Movahedi, Marziyeh
collection	PubMed
description	BACKGROUND: According to structure-dependent function of proteins, two main challenging problems called Protein Structure Prediction (PSP) and Inverse Protein Folding (IPF) are investigated. In spite of IPF essential applications, it has not been investigated as much as PSP problem. In fact, the ultimate goal of IPF problem or protein design is to create proteins with enhanced properties or even novel functions. One of the major computational challenges in protein design is its large sequence space, namely searching through all plausible sequences is impossible. Inasmuch as, protein secondary structure represents an appropriate primary scaffold of the protein conformation, undoubtedly studying the Protein Secondary Structure Inverse Folding (PSSIF) problem is a quantum leap forward in protein design, as it can reduce the search space. In this paper, a novel genetic algorithm which uses native secondary sub-structures is proposed to solve PSSIF problem. In essence, evolutionary information can lead the algorithm to design appropriate amino acid sequences respective to the target secondary structures. Furthermore, they can be folded to tertiary structures almost similar to their reference 3D structures. RESULTS: The proposed algorithm called GAPSSIF benefits from evolutionary information obtained by solved proteins in the PDB. Therefore, we construct a repository of protein secondary sub-structures to accelerate convergence of the algorithm. The secondary structure of designed sequences by GAPSSIF is comparable with those obtained by Evolver and EvoDesign. Although we do not explicitly consider tertiary structure features through the algorithm, the structural similarity of native and designed sequences declares acceptable values. CONCLUSIONS: Using the evolutionary information of native structures can significantly improve the quality of designed sequences. In fact, the combination of this information and effective features such as solvent accessibility and torsion angles leads IPF problem to an efficient solution. GAPSSIF can be downloaded at http://bioinformatics.aut.ac.ir/GAPSSIF/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1199-y) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5011913
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-50119132016-09-15 Evaluating the accuracy of protein design using native secondary sub-structures Movahedi, Marziyeh Zare-Mirakabad, Fatemeh Arab, Seyed Shahriar BMC Bioinformatics Research Article BACKGROUND: According to structure-dependent function of proteins, two main challenging problems called Protein Structure Prediction (PSP) and Inverse Protein Folding (IPF) are investigated. In spite of IPF essential applications, it has not been investigated as much as PSP problem. In fact, the ultimate goal of IPF problem or protein design is to create proteins with enhanced properties or even novel functions. One of the major computational challenges in protein design is its large sequence space, namely searching through all plausible sequences is impossible. Inasmuch as, protein secondary structure represents an appropriate primary scaffold of the protein conformation, undoubtedly studying the Protein Secondary Structure Inverse Folding (PSSIF) problem is a quantum leap forward in protein design, as it can reduce the search space. In this paper, a novel genetic algorithm which uses native secondary sub-structures is proposed to solve PSSIF problem. In essence, evolutionary information can lead the algorithm to design appropriate amino acid sequences respective to the target secondary structures. Furthermore, they can be folded to tertiary structures almost similar to their reference 3D structures. RESULTS: The proposed algorithm called GAPSSIF benefits from evolutionary information obtained by solved proteins in the PDB. Therefore, we construct a repository of protein secondary sub-structures to accelerate convergence of the algorithm. The secondary structure of designed sequences by GAPSSIF is comparable with those obtained by Evolver and EvoDesign. Although we do not explicitly consider tertiary structure features through the algorithm, the structural similarity of native and designed sequences declares acceptable values. CONCLUSIONS: Using the evolutionary information of native structures can significantly improve the quality of designed sequences. In fact, the combination of this information and effective features such as solvent accessibility and torsion angles leads IPF problem to an efficient solution. GAPSSIF can be downloaded at http://bioinformatics.aut.ac.ir/GAPSSIF/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1199-y) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-05 /pmc/articles/PMC5011913/ /pubmed/27597167 http://dx.doi.org/10.1186/s12859-016-1199-y Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Movahedi, Marziyeh Zare-Mirakabad, Fatemeh Arab, Seyed Shahriar Evaluating the accuracy of protein design using native secondary sub-structures
title	Evaluating the accuracy of protein design using native secondary sub-structures
title_full	Evaluating the accuracy of protein design using native secondary sub-structures
title_fullStr	Evaluating the accuracy of protein design using native secondary sub-structures
title_full_unstemmed	Evaluating the accuracy of protein design using native secondary sub-structures
title_short	Evaluating the accuracy of protein design using native secondary sub-structures
title_sort	evaluating the accuracy of protein design using native secondary sub-structures
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5011913/ https://www.ncbi.nlm.nih.gov/pubmed/27597167 http://dx.doi.org/10.1186/s12859-016-1199-y
work_keys_str_mv	AT movahedimarziyeh evaluatingtheaccuracyofproteindesignusingnativesecondarysubstructures AT zaremirakabadfatemeh evaluatingtheaccuracyofproteindesignusingnativesecondarysubstructures AT arabseyedshahriar evaluatingtheaccuracyofproteindesignusingnativesecondarysubstructures

Evaluating the accuracy of protein design using native secondary sub-structures

Ejemplares similares