Cargando…

Improving Contact Prediction along Three Dimensions

Correlation patterns in multiple sequence alignments of homologous proteins can be exploited to infer information on the three-dimensional structure of their members. The typical pipeline to address this task, which we in this paper refer to as the three dimensions of contact prediction, is to (i) f...

Descripción completa

Detalles Bibliográficos
Autores principales: Feinauer, Christoph, Skwark, Marcin J., Pagnani, Andrea, Aurell, Erik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4191875/
https://www.ncbi.nlm.nih.gov/pubmed/25299132
http://dx.doi.org/10.1371/journal.pcbi.1003847
_version_ 1782338718721376256
author Feinauer, Christoph
Skwark, Marcin J.
Pagnani, Andrea
Aurell, Erik
author_facet Feinauer, Christoph
Skwark, Marcin J.
Pagnani, Andrea
Aurell, Erik
author_sort Feinauer, Christoph
collection PubMed
description Correlation patterns in multiple sequence alignments of homologous proteins can be exploited to infer information on the three-dimensional structure of their members. The typical pipeline to address this task, which we in this paper refer to as the three dimensions of contact prediction, is to (i) filter and align the raw sequence data representing the evolutionarily related proteins; (ii) choose a predictive model to describe a sequence alignment; (iii) infer the model parameters and interpret them in terms of structural properties, such as an accurate contact map. We show here that all three dimensions are important for overall prediction success. In particular, we show that it is possible to improve significantly along the second dimension by going beyond the pair-wise Potts models from statistical physics, which have hitherto been the focus of the field. These (simple) extensions are motivated by multiple sequence alignments often containing long stretches of gaps which, as a data feature, would be rather untypical for independent samples drawn from a Potts model. Using a large test set of proteins we show that the combined improvements along the three dimensions are as large as any reported to date.
format Online
Article
Text
id pubmed-4191875
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41918752014-10-14 Improving Contact Prediction along Three Dimensions Feinauer, Christoph Skwark, Marcin J. Pagnani, Andrea Aurell, Erik PLoS Comput Biol Research Article Correlation patterns in multiple sequence alignments of homologous proteins can be exploited to infer information on the three-dimensional structure of their members. The typical pipeline to address this task, which we in this paper refer to as the three dimensions of contact prediction, is to (i) filter and align the raw sequence data representing the evolutionarily related proteins; (ii) choose a predictive model to describe a sequence alignment; (iii) infer the model parameters and interpret them in terms of structural properties, such as an accurate contact map. We show here that all three dimensions are important for overall prediction success. In particular, we show that it is possible to improve significantly along the second dimension by going beyond the pair-wise Potts models from statistical physics, which have hitherto been the focus of the field. These (simple) extensions are motivated by multiple sequence alignments often containing long stretches of gaps which, as a data feature, would be rather untypical for independent samples drawn from a Potts model. Using a large test set of proteins we show that the combined improvements along the three dimensions are as large as any reported to date. Public Library of Science 2014-10-09 /pmc/articles/PMC4191875/ /pubmed/25299132 http://dx.doi.org/10.1371/journal.pcbi.1003847 Text en © 2014 Feinauer et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Feinauer, Christoph
Skwark, Marcin J.
Pagnani, Andrea
Aurell, Erik
Improving Contact Prediction along Three Dimensions
title Improving Contact Prediction along Three Dimensions
title_full Improving Contact Prediction along Three Dimensions
title_fullStr Improving Contact Prediction along Three Dimensions
title_full_unstemmed Improving Contact Prediction along Three Dimensions
title_short Improving Contact Prediction along Three Dimensions
title_sort improving contact prediction along three dimensions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4191875/
https://www.ncbi.nlm.nih.gov/pubmed/25299132
http://dx.doi.org/10.1371/journal.pcbi.1003847
work_keys_str_mv AT feinauerchristoph improvingcontactpredictionalongthreedimensions
AT skwarkmarcinj improvingcontactpredictionalongthreedimensions
AT pagnaniandrea improvingcontactpredictionalongthreedimensions
AT aurellerik improvingcontactpredictionalongthreedimensions