Cargando…
SeqRate: sequence-based protein folding type classification and rates prediction
BACKGROUND: Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. An...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863059/ https://www.ncbi.nlm.nih.gov/pubmed/20438647 http://dx.doi.org/10.1186/1471-2105-11-S3-S1 |
_version_ | 1782180741157748736 |
---|---|
author | Lin, Guan Ning Wang, Zheng Xu, Dong Cheng, Jianlin |
author_facet | Lin, Guan Ning Wang, Zheng Xu, Dong Cheng, Jianlin |
author_sort | Lin, Guan Ning |
collection | PubMed |
description | BACKGROUND: Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. RESULTS: We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec(-1)) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. CONCLUSIONS: Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html. |
format | Text |
id | pubmed-2863059 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28630592010-05-04 SeqRate: sequence-based protein folding type classification and rates prediction Lin, Guan Ning Wang, Zheng Xu, Dong Cheng, Jianlin BMC Bioinformatics Proceedings BACKGROUND: Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. RESULTS: We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec(-1)) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. CONCLUSIONS: Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html. BioMed Central 2010-04-29 /pmc/articles/PMC2863059/ /pubmed/20438647 http://dx.doi.org/10.1186/1471-2105-11-S3-S1 Text en Copyright ©2010 Cheng et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Lin, Guan Ning Wang, Zheng Xu, Dong Cheng, Jianlin SeqRate: sequence-based protein folding type classification and rates prediction |
title | SeqRate: sequence-based protein folding type classification and rates prediction |
title_full | SeqRate: sequence-based protein folding type classification and rates prediction |
title_fullStr | SeqRate: sequence-based protein folding type classification and rates prediction |
title_full_unstemmed | SeqRate: sequence-based protein folding type classification and rates prediction |
title_short | SeqRate: sequence-based protein folding type classification and rates prediction |
title_sort | seqrate: sequence-based protein folding type classification and rates prediction |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863059/ https://www.ncbi.nlm.nih.gov/pubmed/20438647 http://dx.doi.org/10.1186/1471-2105-11-S3-S1 |
work_keys_str_mv | AT linguanning seqratesequencebasedproteinfoldingtypeclassificationandratesprediction AT wangzheng seqratesequencebasedproteinfoldingtypeclassificationandratesprediction AT xudong seqratesequencebasedproteinfoldingtypeclassificationandratesprediction AT chengjianlin seqratesequencebasedproteinfoldingtypeclassificationandratesprediction |