Cargando…
Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms
Recently, predicting proteins three-dimensional (3D) structure from its sequence information has made a significant progress due to the advances in computational techniques and the growth of experimental structures. However, selecting good models from a structural model pool is an important and chal...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4164442/ https://www.ncbi.nlm.nih.gov/pubmed/25222008 http://dx.doi.org/10.1371/journal.pone.0106542 |
_version_ | 1782334956413911040 |
---|---|
author | Manavalan, Balachandran Lee, Juyong Lee, Jooyoung |
author_facet | Manavalan, Balachandran Lee, Juyong Lee, Jooyoung |
author_sort | Manavalan, Balachandran |
collection | PubMed |
description | Recently, predicting proteins three-dimensional (3D) structure from its sequence information has made a significant progress due to the advances in computational techniques and the growth of experimental structures. However, selecting good models from a structural model pool is an important and challenging task in protein structure prediction. In this study, we present the first application of random forest based model quality assessment (RFMQA) to rank protein models using its structural features and knowledge-based potential energy terms. The method predicts a relative score of a model by using its secondary structure, solvent accessibility and knowledge-based potential energy terms. We trained and tested the RFMQA method on CASP8 and CASP9 targets using 5-fold cross-validation. The correlation coefficient between the TM-score of the model selected by RFMQA (TM(RF)) and the best server model (TM(best)) is 0.945. We benchmarked our method on recent CASP10 targets by using CASP8 and 9 server models as a training set. The correlation coefficient and average difference between TM(RF) and TM(best) over 95 CASP10 targets are 0.984 and 0.0385, respectively. The test results show that our method works better in selecting top models when compared with other top performing methods. RFMQA is available for download from http://lee.kias.re.kr/RFMQA/RFMQA_eval.tar.gz. |
format | Online Article Text |
id | pubmed-4164442 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41644422014-09-19 Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms Manavalan, Balachandran Lee, Juyong Lee, Jooyoung PLoS One Research Article Recently, predicting proteins three-dimensional (3D) structure from its sequence information has made a significant progress due to the advances in computational techniques and the growth of experimental structures. However, selecting good models from a structural model pool is an important and challenging task in protein structure prediction. In this study, we present the first application of random forest based model quality assessment (RFMQA) to rank protein models using its structural features and knowledge-based potential energy terms. The method predicts a relative score of a model by using its secondary structure, solvent accessibility and knowledge-based potential energy terms. We trained and tested the RFMQA method on CASP8 and CASP9 targets using 5-fold cross-validation. The correlation coefficient between the TM-score of the model selected by RFMQA (TM(RF)) and the best server model (TM(best)) is 0.945. We benchmarked our method on recent CASP10 targets by using CASP8 and 9 server models as a training set. The correlation coefficient and average difference between TM(RF) and TM(best) over 95 CASP10 targets are 0.984 and 0.0385, respectively. The test results show that our method works better in selecting top models when compared with other top performing methods. RFMQA is available for download from http://lee.kias.re.kr/RFMQA/RFMQA_eval.tar.gz. Public Library of Science 2014-09-15 /pmc/articles/PMC4164442/ /pubmed/25222008 http://dx.doi.org/10.1371/journal.pone.0106542 Text en © 2014 Manavalan et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Manavalan, Balachandran Lee, Juyong Lee, Jooyoung Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms |
title | Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms |
title_full | Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms |
title_fullStr | Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms |
title_full_unstemmed | Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms |
title_short | Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms |
title_sort | random forest-based protein model quality assessment (rfmqa) using structural features and potential energy terms |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4164442/ https://www.ncbi.nlm.nih.gov/pubmed/25222008 http://dx.doi.org/10.1371/journal.pone.0106542 |
work_keys_str_mv | AT manavalanbalachandran randomforestbasedproteinmodelqualityassessmentrfmqausingstructuralfeaturesandpotentialenergyterms AT leejuyong randomforestbasedproteinmodelqualityassessmentrfmqausingstructuralfeaturesandpotentialenergyterms AT leejooyoung randomforestbasedproteinmodelqualityassessmentrfmqausingstructuralfeaturesandpotentialenergyterms |