Cargando…
Sequence feature-based prediction of protein stability changes upon amino acid substitutions
BACKGROUND: Protein destabilization is a common mechanism by which amino acid substitutions cause human diseases. Although several machine learning methods have been reported for predicting protein stability changes upon amino acid substitutions, the previous studies did not utilize relevant sequenc...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2975416/ https://www.ncbi.nlm.nih.gov/pubmed/21047386 http://dx.doi.org/10.1186/1471-2164-11-S2-S5 |
_version_ | 1782190944286670848 |
---|---|
author | Teng, Shaolei Srivastava, Anand K Wang, Liangjiang |
author_facet | Teng, Shaolei Srivastava, Anand K Wang, Liangjiang |
author_sort | Teng, Shaolei |
collection | PubMed |
description | BACKGROUND: Protein destabilization is a common mechanism by which amino acid substitutions cause human diseases. Although several machine learning methods have been reported for predicting protein stability changes upon amino acid substitutions, the previous studies did not utilize relevant sequence features representing biological knowledge for classifier construction. RESULTS: In this study, a new machine learning method has been developed for sequence feature-based prediction of protein stability changes upon amino acid substitutions. Support vector machines were trained with data from experimental studies on the free energy change of protein stability upon mutations. To construct accurate classifiers, twenty sequence features were examined for input vector encoding. It was shown that classifier performance varied significantly by using different sequence features. The most accurate classifier in this study was constructed using a combination of six sequence features. This classifier achieved an overall accuracy of 84.59% with 70.29% sensitivity and 90.98% specificity. CONCLUSIONS: Relevant sequence features can be used to accurately predict protein stability changes upon amino acid substitutions. Predictive results at this level of accuracy may provide useful information to distinguish between deleterious and tolerant alterations in disease candidate genes. To make the classifier accessible to the genetics research community, we have developed a new web server, called MuStab (http://bioinfo.ggc.org/mustab/). |
format | Text |
id | pubmed-2975416 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-29754162010-11-09 Sequence feature-based prediction of protein stability changes upon amino acid substitutions Teng, Shaolei Srivastava, Anand K Wang, Liangjiang BMC Genomics Research BACKGROUND: Protein destabilization is a common mechanism by which amino acid substitutions cause human diseases. Although several machine learning methods have been reported for predicting protein stability changes upon amino acid substitutions, the previous studies did not utilize relevant sequence features representing biological knowledge for classifier construction. RESULTS: In this study, a new machine learning method has been developed for sequence feature-based prediction of protein stability changes upon amino acid substitutions. Support vector machines were trained with data from experimental studies on the free energy change of protein stability upon mutations. To construct accurate classifiers, twenty sequence features were examined for input vector encoding. It was shown that classifier performance varied significantly by using different sequence features. The most accurate classifier in this study was constructed using a combination of six sequence features. This classifier achieved an overall accuracy of 84.59% with 70.29% sensitivity and 90.98% specificity. CONCLUSIONS: Relevant sequence features can be used to accurately predict protein stability changes upon amino acid substitutions. Predictive results at this level of accuracy may provide useful information to distinguish between deleterious and tolerant alterations in disease candidate genes. To make the classifier accessible to the genetics research community, we have developed a new web server, called MuStab (http://bioinfo.ggc.org/mustab/). BioMed Central 2010-11-02 /pmc/articles/PMC2975416/ /pubmed/21047386 http://dx.doi.org/10.1186/1471-2164-11-S2-S5 Text en Copyright ©2010 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Teng, Shaolei Srivastava, Anand K Wang, Liangjiang Sequence feature-based prediction of protein stability changes upon amino acid substitutions |
title | Sequence feature-based prediction of protein stability changes upon amino acid substitutions |
title_full | Sequence feature-based prediction of protein stability changes upon amino acid substitutions |
title_fullStr | Sequence feature-based prediction of protein stability changes upon amino acid substitutions |
title_full_unstemmed | Sequence feature-based prediction of protein stability changes upon amino acid substitutions |
title_short | Sequence feature-based prediction of protein stability changes upon amino acid substitutions |
title_sort | sequence feature-based prediction of protein stability changes upon amino acid substitutions |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2975416/ https://www.ncbi.nlm.nih.gov/pubmed/21047386 http://dx.doi.org/10.1186/1471-2164-11-S2-S5 |
work_keys_str_mv | AT tengshaolei sequencefeaturebasedpredictionofproteinstabilitychangesuponaminoacidsubstitutions AT srivastavaanandk sequencefeaturebasedpredictionofproteinstabilitychangesuponaminoacidsubstitutions AT wangliangjiang sequencefeaturebasedpredictionofproteinstabilitychangesuponaminoacidsubstitutions |