Cargando…
Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks
BACKGROUND: Protein inter-residue contact maps provide a translation and rotation invariant topological representation of a protein. They can be used as an intermediary step in protein structure predictions. However, the prediction of contact maps represents an unbalanced problem as far fewer exampl...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893389/ https://www.ncbi.nlm.nih.gov/pubmed/24410833 http://dx.doi.org/10.1186/1471-2105-15-6 |
_version_ | 1782299676887744512 |
---|---|
author | Kukic, Predrag Mirabello, Claudio Tradigo, Giuseppe Walsh, Ian Veltri, Pierangelo Pollastri, Gianluca |
author_facet | Kukic, Predrag Mirabello, Claudio Tradigo, Giuseppe Walsh, Ian Veltri, Pierangelo Pollastri, Gianluca |
author_sort | Kukic, Predrag |
collection | PubMed |
description | BACKGROUND: Protein inter-residue contact maps provide a translation and rotation invariant topological representation of a protein. They can be used as an intermediary step in protein structure predictions. However, the prediction of contact maps represents an unbalanced problem as far fewer examples of contacts than non-contacts exist in a protein structure. In this study we explore the possibility of completely eliminating the unbalanced nature of the contact map prediction problem by predicting real-value distances between residues. Predicting full inter-residue distance maps and applying them in protein structure predictions has been relatively unexplored in the past. RESULTS: We initially demonstrate that the use of native-like distance maps is able to reproduce 3D structures almost identical to the targets, giving an average RMSD of 0.5Å. In addition, the corrupted physical maps with an introduced random error of ±6Å are able to reconstruct the targets within an average RMSD of 2Å. After demonstrating the reconstruction potential of distance maps, we develop two classes of predictors using two-dimensional recursive neural networks: an ab initio predictor that relies only on the protein sequence and evolutionary information, and a template-based predictor in which additional structural homology information is provided. We find that the ab initio predictor is able to reproduce distances with an RMSD of 6Å, regardless of the evolutionary content provided. Furthermore, we show that the template-based predictor exploits both sequence and structure information even in cases of dubious homology and outperforms the best template hit with a clear margin of up to 3.7Å. Lastly, we demonstrate the ability of the two predictors to reconstruct the CASP9 targets shorter than 200 residues producing the results similar to the state of the machine learning art approach implemented in the Distill server. CONCLUSIONS: The methodology presented here, if complemented by more complex reconstruction protocols, can represent a possible path to improve machine learning algorithms for 3D protein structure prediction. Moreover, it can be used as an intermediary step in protein structure predictions either on its own or complemented by NMR restraints. |
format | Online Article Text |
id | pubmed-3893389 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38933892014-01-27 Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks Kukic, Predrag Mirabello, Claudio Tradigo, Giuseppe Walsh, Ian Veltri, Pierangelo Pollastri, Gianluca BMC Bioinformatics Research Article BACKGROUND: Protein inter-residue contact maps provide a translation and rotation invariant topological representation of a protein. They can be used as an intermediary step in protein structure predictions. However, the prediction of contact maps represents an unbalanced problem as far fewer examples of contacts than non-contacts exist in a protein structure. In this study we explore the possibility of completely eliminating the unbalanced nature of the contact map prediction problem by predicting real-value distances between residues. Predicting full inter-residue distance maps and applying them in protein structure predictions has been relatively unexplored in the past. RESULTS: We initially demonstrate that the use of native-like distance maps is able to reproduce 3D structures almost identical to the targets, giving an average RMSD of 0.5Å. In addition, the corrupted physical maps with an introduced random error of ±6Å are able to reconstruct the targets within an average RMSD of 2Å. After demonstrating the reconstruction potential of distance maps, we develop two classes of predictors using two-dimensional recursive neural networks: an ab initio predictor that relies only on the protein sequence and evolutionary information, and a template-based predictor in which additional structural homology information is provided. We find that the ab initio predictor is able to reproduce distances with an RMSD of 6Å, regardless of the evolutionary content provided. Furthermore, we show that the template-based predictor exploits both sequence and structure information even in cases of dubious homology and outperforms the best template hit with a clear margin of up to 3.7Å. Lastly, we demonstrate the ability of the two predictors to reconstruct the CASP9 targets shorter than 200 residues producing the results similar to the state of the machine learning art approach implemented in the Distill server. CONCLUSIONS: The methodology presented here, if complemented by more complex reconstruction protocols, can represent a possible path to improve machine learning algorithms for 3D protein structure prediction. Moreover, it can be used as an intermediary step in protein structure predictions either on its own or complemented by NMR restraints. BioMed Central 2014-01-10 /pmc/articles/PMC3893389/ /pubmed/24410833 http://dx.doi.org/10.1186/1471-2105-15-6 Text en Copyright © 2014 Kukic et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Kukic, Predrag Mirabello, Claudio Tradigo, Giuseppe Walsh, Ian Veltri, Pierangelo Pollastri, Gianluca Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks |
title | Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks |
title_full | Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks |
title_fullStr | Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks |
title_full_unstemmed | Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks |
title_short | Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks |
title_sort | toward an accurate prediction of inter-residue distances in proteins using 2d recursive neural networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893389/ https://www.ncbi.nlm.nih.gov/pubmed/24410833 http://dx.doi.org/10.1186/1471-2105-15-6 |
work_keys_str_mv | AT kukicpredrag towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks AT mirabelloclaudio towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks AT tradigogiuseppe towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks AT walshian towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks AT veltripierangelo towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks AT pollastrigianluca towardanaccuratepredictionofinterresiduedistancesinproteinsusing2drecursiveneuralnetworks |