Cargando…

Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks

BACKGROUND: Protein inter-residue contact maps provide a translation and rotation invariant topological representation of a protein. They can be used as an intermediary step in protein structure predictions. However, the prediction of contact maps represents an unbalanced problem as far fewer exampl...

Descripción completa

Detalles Bibliográficos
Autores principales: Kukic, Predrag, Mirabello, Claudio, Tradigo, Giuseppe, Walsh, Ian, Veltri, Pierangelo, Pollastri, Gianluca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893389/
https://www.ncbi.nlm.nih.gov/pubmed/24410833
http://dx.doi.org/10.1186/1471-2105-15-6
_version_ 1782299676887744512
author Kukic, Predrag
Mirabello, Claudio
Tradigo, Giuseppe
Walsh, Ian
Veltri, Pierangelo
Pollastri, Gianluca
author_facet Kukic, Predrag
Mirabello, Claudio
Tradigo, Giuseppe
Walsh, Ian
Veltri, Pierangelo
Pollastri, Gianluca
author_sort Kukic, Predrag
collection PubMed
description BACKGROUND: Protein inter-residue contact maps provide a translation and rotation invariant topological representation of a protein. They can be used as an intermediary step in protein structure predictions. However, the prediction of contact maps represents an unbalanced problem as far fewer examples of contacts than non-contacts exist in a protein structure. In this study we explore the possibility of completely eliminating the unbalanced nature of the contact map prediction problem by predicting real-value distances between residues. Predicting full inter-residue distance maps and applying them in protein structure predictions has been relatively unexplored in the past. RESULTS: We initially demonstrate that the use of native-like distance maps is able to reproduce 3D structures almost identical to the targets, giving an average RMSD of 0.5Å. In addition, the corrupted physical maps with an introduced random error of ±6Å are able to reconstruct the targets within an average RMSD of 2Å. After demonstrating the reconstruction potential of distance maps, we develop two classes of predictors using two-dimensional recursive neural networks: an ab initio predictor that relies only on the protein sequence and evolutionary information, and a template-based predictor in which additional structural homology information is provided. We find that the ab initio predictor is able to reproduce distances with an RMSD of 6Å, regardless of the evolutionary content provided. Furthermore, we show that the template-based predictor exploits both sequence and structure information even in cases of dubious homology and outperforms the best template hit with a clear margin of up to 3.7Å. Lastly, we demonstrate the ability of the two predictors to reconstruct the CASP9 targets shorter than 200 residues producing the results similar to the state of the machine learning art approach implemented in the Distill server. CONCLUSIONS: The methodology presented here, if complemented by more complex reconstruction protocols, can represent a possible path to improve machine learning algorithms for 3D protein structure prediction. Moreover, it can be used as an intermediary step in protein structure predictions either on its own or complemented by NMR restraints.
format Online
Article
Text
id pubmed-3893389
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38933892014-01-27 Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks Kukic, Predrag Mirabello, Claudio Tradigo, Giuseppe Walsh, Ian Veltri, Pierangelo Pollastri, Gianluca BMC Bioinformatics Research Article BACKGROUND: Protein inter-residue contact maps provide a translation and rotation invariant topological representation of a protein. They can be used as an intermediary step in protein structure predictions. However, the prediction of contact maps represents an unbalanced problem as far fewer examples of contacts than non-contacts exist in a protein structure. In this study we explore the possibility of completely eliminating the unbalanced nature of the contact map prediction problem by predicting real-value distances between residues. Predicting full inter-residue distance maps and applying them in protein structure predictions has been relatively unexplored in the past. RESULTS: We initially demonstrate that the use of native-like distance maps is able to reproduce 3D structures almost identical to the targets, giving an average RMSD of 0.5Å. In addition, the corrupted physical maps with an introduced random error of ±6Å are able to reconstruct the targets within an average RMSD of 2Å. After demonstrating the reconstruction potential of distance maps, we develop two classes of predictors using two-dimensional recursive neural networks: an ab initio predictor that relies only on the protein sequence and evolutionary information, and a template-based predictor in which additional structural homology information is provided. We find that the ab initio predictor is able to reproduce distances with an RMSD of 6Å, regardless of the evolutionary content provided. Furthermore, we show that the template-based predictor exploits both sequence and structure information even in cases of dubious homology and outperforms the best template hit with a clear margin of up to 3.7Å. Lastly, we demonstrate the ability of the two predictors to reconstruct the CASP9 targets shorter than 200 residues producing the results similar to the state of the machine learning art approach implemented in the Distill server. CONCLUSIONS: The methodology presented here, if complemented by more complex reconstruction protocols, can represent a possible path to improve machine learning algorithms for 3D protein structure prediction. Moreover, it can be used as an intermediary step in protein structure predictions either on its own or complemented by NMR restraints. BioMed Central 2014-01-10 /pmc/articles/PMC3893389/ /pubmed/24410833 http://dx.doi.org/10.1186/1471-2105-15-6 Text en Copyright © 2014 Kukic et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kukic, Predrag
Mirabello, Claudio
Tradigo, Giuseppe
Walsh, Ian
Veltri, Pierangelo
Pollastri, Gianluca
Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks
title Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks
title_full Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks
title_fullStr Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks
title_full_unstemmed Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks
title_short Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks
title_sort toward an accurate prediction of inter-residue distances in proteins using 2d recursive neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893389/
https://www.ncbi.nlm.nih.gov/pubmed/24410833
http://dx.doi.org/10.1186/1471-2105-15-6
work_keys_str_mv AT kukicpredrag towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks
AT mirabelloclaudio towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks
AT tradigogiuseppe towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks
AT walshian towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks
AT veltripierangelo towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks
AT pollastrigianluca towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks