Cargando…
The Significance of the ProtDeform Score for Structure Prediction and Alignment
BACKGROUND: When a researcher uses a program to align two proteins and gets a score, one of her main concerns is how often the program gives a similar score to pairs that are or are not in the same fold. This issue was analysed in detail recently for the program TM-align with its associated TM-score...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3125161/ https://www.ncbi.nlm.nih.gov/pubmed/21738592 http://dx.doi.org/10.1371/journal.pone.0020889 |
_version_ | 1782207170699329536 |
---|---|
author | Rocha, Jairo Alberich, Ricardo |
author_facet | Rocha, Jairo Alberich, Ricardo |
author_sort | Rocha, Jairo |
collection | PubMed |
description | BACKGROUND: When a researcher uses a program to align two proteins and gets a score, one of her main concerns is how often the program gives a similar score to pairs that are or are not in the same fold. This issue was analysed in detail recently for the program TM-align with its associated TM-score. It was shown that because the TM-score is length independent, it allows a P-value and a hit probability to be defined depending only on the score. Also, it was found that the TM-scores of gapless alignments closely follow an Extreme Value Distribution (EVD). The program ProtDeform for structural protein alignment was developed recently and is characterised by the ability to propose different transformations of different protein regions. Our goal is to analyse its associated score to allow a researcher to have objective reasons to prefer one aligner over another, and carry out a better interpretation of the output. RESULTS: The study on the ProtDeform score reveals that it is length independent in a wider score range than TM-scores and that PD-scores of gapless (random) alignments also approximately follow an EVD. On the CASP8 predictions, PD-scores and TM-scores, with respect to native structures, are highly correlated (0.95), and show that around a fifth of the predictions have a quality as low as 99.5% of the random scores. Using the Gold Standard benchmark, ProtDeform has lower probabilities of error than TM-align both at a similar speed. The analysis is extended to homology discrimination showing that, again, ProtDeform offers higher hit probabilities than TM-align. Finally, we suggest using three different P-values according to the three different contexts: Gapless alignments, optimised alignments for fold discrimination and that for superfamily discrimination. In conclusion, PD-scores are at the very least as valuable for prediction scoring as TM-scores, and on the protein classification problem, even more reliable. |
format | Online Article Text |
id | pubmed-3125161 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31251612011-07-07 The Significance of the ProtDeform Score for Structure Prediction and Alignment Rocha, Jairo Alberich, Ricardo PLoS One Research Article BACKGROUND: When a researcher uses a program to align two proteins and gets a score, one of her main concerns is how often the program gives a similar score to pairs that are or are not in the same fold. This issue was analysed in detail recently for the program TM-align with its associated TM-score. It was shown that because the TM-score is length independent, it allows a P-value and a hit probability to be defined depending only on the score. Also, it was found that the TM-scores of gapless alignments closely follow an Extreme Value Distribution (EVD). The program ProtDeform for structural protein alignment was developed recently and is characterised by the ability to propose different transformations of different protein regions. Our goal is to analyse its associated score to allow a researcher to have objective reasons to prefer one aligner over another, and carry out a better interpretation of the output. RESULTS: The study on the ProtDeform score reveals that it is length independent in a wider score range than TM-scores and that PD-scores of gapless (random) alignments also approximately follow an EVD. On the CASP8 predictions, PD-scores and TM-scores, with respect to native structures, are highly correlated (0.95), and show that around a fifth of the predictions have a quality as low as 99.5% of the random scores. Using the Gold Standard benchmark, ProtDeform has lower probabilities of error than TM-align both at a similar speed. The analysis is extended to homology discrimination showing that, again, ProtDeform offers higher hit probabilities than TM-align. Finally, we suggest using three different P-values according to the three different contexts: Gapless alignments, optimised alignments for fold discrimination and that for superfamily discrimination. In conclusion, PD-scores are at the very least as valuable for prediction scoring as TM-scores, and on the protein classification problem, even more reliable. Public Library of Science 2011-06-28 /pmc/articles/PMC3125161/ /pubmed/21738592 http://dx.doi.org/10.1371/journal.pone.0020889 Text en Rocha, Alberich. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Rocha, Jairo Alberich, Ricardo The Significance of the ProtDeform Score for Structure Prediction and Alignment |
title | The Significance of the ProtDeform Score for Structure Prediction and Alignment |
title_full | The Significance of the ProtDeform Score for Structure Prediction and Alignment |
title_fullStr | The Significance of the ProtDeform Score for Structure Prediction and Alignment |
title_full_unstemmed | The Significance of the ProtDeform Score for Structure Prediction and Alignment |
title_short | The Significance of the ProtDeform Score for Structure Prediction and Alignment |
title_sort | significance of the protdeform score for structure prediction and alignment |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3125161/ https://www.ncbi.nlm.nih.gov/pubmed/21738592 http://dx.doi.org/10.1371/journal.pone.0020889 |
work_keys_str_mv | AT rochajairo thesignificanceoftheprotdeformscoreforstructurepredictionandalignment AT alberichricardo thesignificanceoftheprotdeformscoreforstructurepredictionandalignment AT rochajairo significanceoftheprotdeformscoreforstructurepredictionandalignment AT alberichricardo significanceoftheprotdeformscoreforstructurepredictionandalignment |