Cargando…

Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins

Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Raimondi, Daniele, Orlando, Gabriele, Pancsa, Rita, Khan, Taushif, Vranken, Wim F.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2017
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5562875/ https://www.ncbi.nlm.nih.gov/pubmed/28821744 http://dx.doi.org/10.1038/s41598-017-08366-3

_version_	1783258028382879744
author	Raimondi, Daniele Orlando, Gabriele Pancsa, Rita Khan, Taushif Vranken, Wim F.
author_facet	Raimondi, Daniele Orlando, Gabriele Pancsa, Rita Khan, Taushif Vranken, Wim F.
author_sort	Raimondi, Daniele
collection	PubMed
description	Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states.
format	Online Article Text
id	pubmed-5562875
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-55628752017-08-21 Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins Raimondi, Daniele Orlando, Gabriele Pancsa, Rita Khan, Taushif Vranken, Wim F. Sci Rep Article Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states. Nature Publishing Group UK 2017-08-18 /pmc/articles/PMC5562875/ /pubmed/28821744 http://dx.doi.org/10.1038/s41598-017-08366-3 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Raimondi, Daniele Orlando, Gabriele Pancsa, Rita Khan, Taushif Vranken, Wim F. Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
title	Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
title_full	Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
title_fullStr	Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
title_full_unstemmed	Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
title_short	Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
title_sort	exploring the sequence-based prediction of folding initiation sites in proteins
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5562875/ https://www.ncbi.nlm.nih.gov/pubmed/28821744 http://dx.doi.org/10.1038/s41598-017-08366-3
work_keys_str_mv	AT raimondidaniele exploringthesequencebasedpredictionoffoldinginitiationsitesinproteins AT orlandogabriele exploringthesequencebasedpredictionoffoldinginitiationsitesinproteins AT pancsarita exploringthesequencebasedpredictionoffoldinginitiationsitesinproteins AT khantaushif exploringthesequencebasedpredictionoffoldinginitiationsitesinproteins AT vrankenwimf exploringthesequencebasedpredictionoffoldinginitiationsitesinproteins

Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins

Ejemplares similares