Cargando…

How reliably can we predict the reliability of protein structure predictions?

BACKGROUND: Comparative methods have been the standard techniques for in silico protein structure prediction. The prediction is based on a multiple alignment that contains both reference sequences with known structures and the sequence whose unknown structure is predicted. Intensive research has bee...

Descripción completa

Detalles Bibliográficos
Autores principales: Miklós, István, Novák ', Ádám, Dombai, Balázs, Hein, Jotun
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2324098/
https://www.ncbi.nlm.nih.gov/pubmed/18315874
http://dx.doi.org/10.1186/1471-2105-9-137
_version_ 1782152717882359808
author Miklós, István
Novák ', Ádám
Dombai, Balázs
Hein, Jotun
author_facet Miklós, István
Novák ', Ádám
Dombai, Balázs
Hein, Jotun
author_sort Miklós, István
collection PubMed
description BACKGROUND: Comparative methods have been the standard techniques for in silico protein structure prediction. The prediction is based on a multiple alignment that contains both reference sequences with known structures and the sequence whose unknown structure is predicted. Intensive research has been made to improve the quality of multiple alignments, since misaligned parts of the multiple alignment yield misleading predictions. However, sometimes all methods fail to predict the correct alignment, because the evolutionary signal is too weak to find the homologous parts due to the large number of mutations that separate the sequences. RESULTS: Stochastic sequence alignment methods define a posterior distribution of possible multiple alignments. They can highlight the most likely alignment, and above that, they can give posterior probabilities for each alignment column. We made a comprehensive study on the HOMSTRAD database of structural alignments, predicting secondary structures in four different ways. We showed that alignment posterior probabilities correlate with the reliability of secondary structure predictions, though the strength of the correlation is different for different protocols. The correspondence between the reliability of secondary structure predictions and alignment posterior probabilities is the closest to the identity function when the secondary structure posterior probabilities are calculated from the posterior distribution of multiple alignments. The largest deviation from the identity function has been obtained in the case of predicting secondary structures from a single optimal pairwise alignment. We also showed that alignment posterior probabilities correlate with the 3D distances between C(α )amino acids in superimposed tertiary structures. CONCLUSION: Alignment posterior probabilities can be used to a priori detect errors in comparative models on the sequence alignment level.
format Text
id pubmed-2324098
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23240982008-04-22 How reliably can we predict the reliability of protein structure predictions? Miklós, István Novák ', Ádám Dombai, Balázs Hein, Jotun BMC Bioinformatics Methodology Article BACKGROUND: Comparative methods have been the standard techniques for in silico protein structure prediction. The prediction is based on a multiple alignment that contains both reference sequences with known structures and the sequence whose unknown structure is predicted. Intensive research has been made to improve the quality of multiple alignments, since misaligned parts of the multiple alignment yield misleading predictions. However, sometimes all methods fail to predict the correct alignment, because the evolutionary signal is too weak to find the homologous parts due to the large number of mutations that separate the sequences. RESULTS: Stochastic sequence alignment methods define a posterior distribution of possible multiple alignments. They can highlight the most likely alignment, and above that, they can give posterior probabilities for each alignment column. We made a comprehensive study on the HOMSTRAD database of structural alignments, predicting secondary structures in four different ways. We showed that alignment posterior probabilities correlate with the reliability of secondary structure predictions, though the strength of the correlation is different for different protocols. The correspondence between the reliability of secondary structure predictions and alignment posterior probabilities is the closest to the identity function when the secondary structure posterior probabilities are calculated from the posterior distribution of multiple alignments. The largest deviation from the identity function has been obtained in the case of predicting secondary structures from a single optimal pairwise alignment. We also showed that alignment posterior probabilities correlate with the 3D distances between C(α )amino acids in superimposed tertiary structures. CONCLUSION: Alignment posterior probabilities can be used to a priori detect errors in comparative models on the sequence alignment level. BioMed Central 2008-03-03 /pmc/articles/PMC2324098/ /pubmed/18315874 http://dx.doi.org/10.1186/1471-2105-9-137 Text en Copyright © 2008 Miklós et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Miklós, István
Novák ', Ádám
Dombai, Balázs
Hein, Jotun
How reliably can we predict the reliability of protein structure predictions?
title How reliably can we predict the reliability of protein structure predictions?
title_full How reliably can we predict the reliability of protein structure predictions?
title_fullStr How reliably can we predict the reliability of protein structure predictions?
title_full_unstemmed How reliably can we predict the reliability of protein structure predictions?
title_short How reliably can we predict the reliability of protein structure predictions?
title_sort how reliably can we predict the reliability of protein structure predictions?
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2324098/
https://www.ncbi.nlm.nih.gov/pubmed/18315874
http://dx.doi.org/10.1186/1471-2105-9-137
work_keys_str_mv AT miklosistvan howreliablycanwepredictthereliabilityofproteinstructurepredictions
AT novakadam howreliablycanwepredictthereliabilityofproteinstructurepredictions
AT dombaibalazs howreliablycanwepredictthereliabilityofproteinstructurepredictions
AT heinjotun howreliablycanwepredictthereliabilityofproteinstructurepredictions