Cargando…

Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins

Circular permutation (CP) refers to situations in which the termini of a protein are relocated to other positions in the structure. CP occurs naturally and has been artificially created to study protein function, stability and folding. Recently CP is increasingly applied to engineer enzyme structure...

Descripción completa

Detalles Bibliográficos
Autores principales: Lo, Wei-Cheng, Dai, Tian, Liu, Yen-Yi, Wang, Li-Fen, Hwang, Jenn-Kang, Lyu, Ping-Chiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3281007/
https://www.ncbi.nlm.nih.gov/pubmed/22359629
http://dx.doi.org/10.1371/journal.pone.0031791
_version_ 1782223899154448384
author Lo, Wei-Cheng
Dai, Tian
Liu, Yen-Yi
Wang, Li-Fen
Hwang, Jenn-Kang
Lyu, Ping-Chiang
author_facet Lo, Wei-Cheng
Dai, Tian
Liu, Yen-Yi
Wang, Li-Fen
Hwang, Jenn-Kang
Lyu, Ping-Chiang
author_sort Lo, Wei-Cheng
collection PubMed
description Circular permutation (CP) refers to situations in which the termini of a protein are relocated to other positions in the structure. CP occurs naturally and has been artificially created to study protein function, stability and folding. Recently CP is increasingly applied to engineer enzyme structure and function, and to create bifunctional fusion proteins unachievable by tandem fusion. CP is a complicated and expensive technique. An intrinsic difficulty in its application lies in the fact that not every position in a protein is amenable for creating a viable permutant. To examine the preferences of CP and develop CP viability prediction methods, we carried out comprehensive analyses of the sequence, structural, and dynamical properties of known CP sites using a variety of statistics and simulation methods, such as the bootstrap aggregating, permutation test and molecular dynamics simulations. CP particularly favors Gly, Pro, Asp and Asn. Positions preferred by CP lie within coils, loops, turns, and at residues that are exposed to solvent, weakly hydrogen-bonded, environmentally unpacked, or flexible. Disfavored positions include Cys, bulky hydrophobic residues, and residues located within helices or near the protein's core. These results fostered the development of an effective viable CP site prediction system, which combined four machine learning methods, e.g., artificial neural networks, the support vector machine, a random forest, and a hierarchical feature integration procedure developed in this work. As assessed by using the hydrofolate reductase dataset as the independent evaluation dataset, this prediction system achieved an AUC of 0.9. Large-scale predictions have been performed for nine thousand representative protein structures; several new potential applications of CP were thus identified. Many unreported preferences of CP are revealed in this study. The developed system is the best CP viability prediction method currently available. This work will facilitate the application of CP in research and biotechnology.
format Online
Article
Text
id pubmed-3281007
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32810072012-02-22 Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins Lo, Wei-Cheng Dai, Tian Liu, Yen-Yi Wang, Li-Fen Hwang, Jenn-Kang Lyu, Ping-Chiang PLoS One Research Article Circular permutation (CP) refers to situations in which the termini of a protein are relocated to other positions in the structure. CP occurs naturally and has been artificially created to study protein function, stability and folding. Recently CP is increasingly applied to engineer enzyme structure and function, and to create bifunctional fusion proteins unachievable by tandem fusion. CP is a complicated and expensive technique. An intrinsic difficulty in its application lies in the fact that not every position in a protein is amenable for creating a viable permutant. To examine the preferences of CP and develop CP viability prediction methods, we carried out comprehensive analyses of the sequence, structural, and dynamical properties of known CP sites using a variety of statistics and simulation methods, such as the bootstrap aggregating, permutation test and molecular dynamics simulations. CP particularly favors Gly, Pro, Asp and Asn. Positions preferred by CP lie within coils, loops, turns, and at residues that are exposed to solvent, weakly hydrogen-bonded, environmentally unpacked, or flexible. Disfavored positions include Cys, bulky hydrophobic residues, and residues located within helices or near the protein's core. These results fostered the development of an effective viable CP site prediction system, which combined four machine learning methods, e.g., artificial neural networks, the support vector machine, a random forest, and a hierarchical feature integration procedure developed in this work. As assessed by using the hydrofolate reductase dataset as the independent evaluation dataset, this prediction system achieved an AUC of 0.9. Large-scale predictions have been performed for nine thousand representative protein structures; several new potential applications of CP were thus identified. Many unreported preferences of CP are revealed in this study. The developed system is the best CP viability prediction method currently available. This work will facilitate the application of CP in research and biotechnology. Public Library of Science 2012-02-16 /pmc/articles/PMC3281007/ /pubmed/22359629 http://dx.doi.org/10.1371/journal.pone.0031791 Text en Lo et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lo, Wei-Cheng
Dai, Tian
Liu, Yen-Yi
Wang, Li-Fen
Hwang, Jenn-Kang
Lyu, Ping-Chiang
Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins
title Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins
title_full Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins
title_fullStr Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins
title_full_unstemmed Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins
title_short Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins
title_sort deciphering the preference and predicting the viability of circular permutations in proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3281007/
https://www.ncbi.nlm.nih.gov/pubmed/22359629
http://dx.doi.org/10.1371/journal.pone.0031791
work_keys_str_mv AT loweicheng decipheringthepreferenceandpredictingtheviabilityofcircularpermutationsinproteins
AT daitian decipheringthepreferenceandpredictingtheviabilityofcircularpermutationsinproteins
AT liuyenyi decipheringthepreferenceandpredictingtheviabilityofcircularpermutationsinproteins
AT wanglifen decipheringthepreferenceandpredictingtheviabilityofcircularpermutationsinproteins
AT hwangjennkang decipheringthepreferenceandpredictingtheviabilityofcircularpermutationsinproteins
AT lyupingchiang decipheringthepreferenceandpredictingtheviabilityofcircularpermutationsinproteins