Cargando…

Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding

The exponential growth over the past several decades in the quantity of both primary sequence data available and the number of protein structures determined has provided a wealth of information describing the relationship between protein primary sequence and tertiary structure. This growing reposito...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wathen, Brent, Jia, Zongchao
Formato:	Texto
Lenguaje:	English
Publicado:	Molecular Diversity Preservation International (MDPI) 2009
Materias:	Review
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2680634/ https://www.ncbi.nlm.nih.gov/pubmed/19468326 http://dx.doi.org/10.3390/ijms10041567

_version_	1782166962287149056
author	Wathen, Brent Jia, Zongchao
author_facet	Wathen, Brent Jia, Zongchao
author_sort	Wathen, Brent
collection	PubMed
description	The exponential growth over the past several decades in the quantity of both primary sequence data available and the number of protein structures determined has provided a wealth of information describing the relationship between protein primary sequence and tertiary structure. This growing repository of data has served as a prime source for statistical analysis, where underlying relationships between patterns of amino acids and protein structure can be uncovered. Here, we survey the main statistical approaches that have been used for identifying patterns within protein sequences, and discuss sequence pattern research as it relates to both secondary and tertiary protein structure. Limitations to statistical analyses are discussed, and a context for their role within the field of protein folding is given. We conclude by describing a novel statistical study of residue patterning in β-strands, which finds that hydrophobic (i,i+2) pairing in β-strands occurs more often than expected at locations near strand termini. Interpretations involving β-sheet nucleation and growth are discussed.
format	Text
id	pubmed-2680634
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	Molecular Diversity Preservation International (MDPI)
record_format	MEDLINE/PubMed
spelling	pubmed-26806342009-05-22 Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding Wathen, Brent Jia, Zongchao Int J Mol Sci Review The exponential growth over the past several decades in the quantity of both primary sequence data available and the number of protein structures determined has provided a wealth of information describing the relationship between protein primary sequence and tertiary structure. This growing repository of data has served as a prime source for statistical analysis, where underlying relationships between patterns of amino acids and protein structure can be uncovered. Here, we survey the main statistical approaches that have been used for identifying patterns within protein sequences, and discuss sequence pattern research as it relates to both secondary and tertiary protein structure. Limitations to statistical analyses are discussed, and a context for their role within the field of protein folding is given. We conclude by describing a novel statistical study of residue patterning in β-strands, which finds that hydrophobic (i,i+2) pairing in β-strands occurs more often than expected at locations near strand termini. Interpretations involving β-sheet nucleation and growth are discussed. Molecular Diversity Preservation International (MDPI) 2009-04-08 /pmc/articles/PMC2680634/ /pubmed/19468326 http://dx.doi.org/10.3390/ijms10041567 Text en © 2009 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle	Review Wathen, Brent Jia, Zongchao Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding
title	Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding
title_full	Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding
title_fullStr	Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding
title_full_unstemmed	Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding
title_short	Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding
title_sort	folding by numbers: primary sequence statistics and their use in studying protein folding
topic	Review
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2680634/ https://www.ncbi.nlm.nih.gov/pubmed/19468326 http://dx.doi.org/10.3390/ijms10041567
work_keys_str_mv	AT wathenbrent foldingbynumbersprimarysequencestatisticsandtheiruseinstudyingproteinfolding AT jiazongchao foldingbynumbersprimarysequencestatisticsandtheiruseinstudyingproteinfolding

Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding

Ejemplares similares