Cargando…

SeqRate: sequence-based protein folding type classification and rates prediction

BACKGROUND: Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. An...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Guan Ning, Wang, Zheng, Xu, Dong, Cheng, Jianlin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863059/
https://www.ncbi.nlm.nih.gov/pubmed/20438647
http://dx.doi.org/10.1186/1471-2105-11-S3-S1
_version_ 1782180741157748736
author Lin, Guan Ning
Wang, Zheng
Xu, Dong
Cheng, Jianlin
author_facet Lin, Guan Ning
Wang, Zheng
Xu, Dong
Cheng, Jianlin
author_sort Lin, Guan Ning
collection PubMed
description BACKGROUND: Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. RESULTS: We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec(-1)) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. CONCLUSIONS: Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html.
format Text
id pubmed-2863059
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28630592010-05-04 SeqRate: sequence-based protein folding type classification and rates prediction Lin, Guan Ning Wang, Zheng Xu, Dong Cheng, Jianlin BMC Bioinformatics Proceedings BACKGROUND: Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. RESULTS: We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec(-1)) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. CONCLUSIONS: Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html. BioMed Central 2010-04-29 /pmc/articles/PMC2863059/ /pubmed/20438647 http://dx.doi.org/10.1186/1471-2105-11-S3-S1 Text en Copyright ©2010 Cheng et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Lin, Guan Ning
Wang, Zheng
Xu, Dong
Cheng, Jianlin
SeqRate: sequence-based protein folding type classification and rates prediction
title SeqRate: sequence-based protein folding type classification and rates prediction
title_full SeqRate: sequence-based protein folding type classification and rates prediction
title_fullStr SeqRate: sequence-based protein folding type classification and rates prediction
title_full_unstemmed SeqRate: sequence-based protein folding type classification and rates prediction
title_short SeqRate: sequence-based protein folding type classification and rates prediction
title_sort seqrate: sequence-based protein folding type classification and rates prediction
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863059/
https://www.ncbi.nlm.nih.gov/pubmed/20438647
http://dx.doi.org/10.1186/1471-2105-11-S3-S1
work_keys_str_mv AT linguanning seqratesequencebasedproteinfoldingtypeclassificationandratesprediction
AT wangzheng seqratesequencebasedproteinfoldingtypeclassificationandratesprediction
AT xudong seqratesequencebasedproteinfoldingtypeclassificationandratesprediction
AT chengjianlin seqratesequencebasedproteinfoldingtypeclassificationandratesprediction