Cargando…

A Probabilistic Fragment-Based Protein Structure Prediction Algorithm

Conformational sampling is one of the bottlenecks in fragment-based protein structure prediction approaches. They generally start with a coarse-grained optimization where mainchain atoms and centroids of side chains are considered, followed by a fine-grained optimization with an all-atom representat...

Descripción completa

Detalles Bibliográficos
Autores principales: Simoncini, David, Berenger, Francois, Shrestha, Rojan, Zhang, Kam Y. J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3400640/
https://www.ncbi.nlm.nih.gov/pubmed/22829868
http://dx.doi.org/10.1371/journal.pone.0038799
_version_ 1782238517431107584
author Simoncini, David
Berenger, Francois
Shrestha, Rojan
Zhang, Kam Y. J.
author_facet Simoncini, David
Berenger, Francois
Shrestha, Rojan
Zhang, Kam Y. J.
author_sort Simoncini, David
collection PubMed
description Conformational sampling is one of the bottlenecks in fragment-based protein structure prediction approaches. They generally start with a coarse-grained optimization where mainchain atoms and centroids of side chains are considered, followed by a fine-grained optimization with an all-atom representation of proteins. It is during this coarse-grained phase that fragment-based methods sample intensely the conformational space. If the native-like region is sampled more, the accuracy of the final all-atom predictions may be improved accordingly. In this work we present EdaFold, a new method for fragment-based protein structure prediction based on an Estimation of Distribution Algorithm. Fragment-based approaches build protein models by assembling short fragments from known protein structures. Whereas the probability mass functions over the fragment libraries are uniform in the usual case, we propose an algorithm that learns from previously generated decoys and steers the search toward native-like regions. A comparison with Rosetta AbInitio protocol shows that EdaFold is able to generate models with lower energies and to enhance the percentage of near-native coarse-grained decoys on a benchmark of [Image: see text] proteins. The best coarse-grained models produced by both methods were refined into all-atom models and used in molecular replacement. All atom decoys produced out of EdaFold’s decoy set reach high enough accuracy to solve the crystallographic phase problem by molecular replacement for some test proteins. EdaFold showed a higher success rate in molecular replacement when compared to Rosetta. Our study suggests that improving low resolution coarse-grained decoys allows computational methods to avoid subsequent sampling issues during all-atom refinement and to produce better all-atom models. EdaFold can be downloaded from http://www.riken.jp/zhangiru/software/.
format Online
Article
Text
id pubmed-3400640
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34006402012-07-24 A Probabilistic Fragment-Based Protein Structure Prediction Algorithm Simoncini, David Berenger, Francois Shrestha, Rojan Zhang, Kam Y. J. PLoS One Research Article Conformational sampling is one of the bottlenecks in fragment-based protein structure prediction approaches. They generally start with a coarse-grained optimization where mainchain atoms and centroids of side chains are considered, followed by a fine-grained optimization with an all-atom representation of proteins. It is during this coarse-grained phase that fragment-based methods sample intensely the conformational space. If the native-like region is sampled more, the accuracy of the final all-atom predictions may be improved accordingly. In this work we present EdaFold, a new method for fragment-based protein structure prediction based on an Estimation of Distribution Algorithm. Fragment-based approaches build protein models by assembling short fragments from known protein structures. Whereas the probability mass functions over the fragment libraries are uniform in the usual case, we propose an algorithm that learns from previously generated decoys and steers the search toward native-like regions. A comparison with Rosetta AbInitio protocol shows that EdaFold is able to generate models with lower energies and to enhance the percentage of near-native coarse-grained decoys on a benchmark of [Image: see text] proteins. The best coarse-grained models produced by both methods were refined into all-atom models and used in molecular replacement. All atom decoys produced out of EdaFold’s decoy set reach high enough accuracy to solve the crystallographic phase problem by molecular replacement for some test proteins. EdaFold showed a higher success rate in molecular replacement when compared to Rosetta. Our study suggests that improving low resolution coarse-grained decoys allows computational methods to avoid subsequent sampling issues during all-atom refinement and to produce better all-atom models. EdaFold can be downloaded from http://www.riken.jp/zhangiru/software/. Public Library of Science 2012-07-19 /pmc/articles/PMC3400640/ /pubmed/22829868 http://dx.doi.org/10.1371/journal.pone.0038799 Text en Simoncini et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Simoncini, David
Berenger, Francois
Shrestha, Rojan
Zhang, Kam Y. J.
A Probabilistic Fragment-Based Protein Structure Prediction Algorithm
title A Probabilistic Fragment-Based Protein Structure Prediction Algorithm
title_full A Probabilistic Fragment-Based Protein Structure Prediction Algorithm
title_fullStr A Probabilistic Fragment-Based Protein Structure Prediction Algorithm
title_full_unstemmed A Probabilistic Fragment-Based Protein Structure Prediction Algorithm
title_short A Probabilistic Fragment-Based Protein Structure Prediction Algorithm
title_sort probabilistic fragment-based protein structure prediction algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3400640/
https://www.ncbi.nlm.nih.gov/pubmed/22829868
http://dx.doi.org/10.1371/journal.pone.0038799
work_keys_str_mv AT simoncinidavid aprobabilisticfragmentbasedproteinstructurepredictionalgorithm
AT berengerfrancois aprobabilisticfragmentbasedproteinstructurepredictionalgorithm
AT shrestharojan aprobabilisticfragmentbasedproteinstructurepredictionalgorithm
AT zhangkamyj aprobabilisticfragmentbasedproteinstructurepredictionalgorithm
AT simoncinidavid probabilisticfragmentbasedproteinstructurepredictionalgorithm
AT berengerfrancois probabilisticfragmentbasedproteinstructurepredictionalgorithm
AT shrestharojan probabilisticfragmentbasedproteinstructurepredictionalgorithm
AT zhangkamyj probabilisticfragmentbasedproteinstructurepredictionalgorithm