Cargando…
The JCSG MR pipeline: optimized alignments, multiple models and parallel searches
The success rate of molecular replacement (MR) falls considerably when search models share less than 35% sequence identity with their templates, but can be improved significantly by using fold-recognition methods combined with exhaustive MR searches. Models based on alignments calculated with fold-r...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
International Union of Crystallography
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2394805/ https://www.ncbi.nlm.nih.gov/pubmed/18094477 http://dx.doi.org/10.1107/S0907444907050111 |
_version_ | 1782155454184423424 |
---|---|
author | Schwarzenbacher, Robert Godzik, Adam Jaroszewski, Lukasz |
author_facet | Schwarzenbacher, Robert Godzik, Adam Jaroszewski, Lukasz |
author_sort | Schwarzenbacher, Robert |
collection | PubMed |
description | The success rate of molecular replacement (MR) falls considerably when search models share less than 35% sequence identity with their templates, but can be improved significantly by using fold-recognition methods combined with exhaustive MR searches. Models based on alignments calculated with fold-recognition algorithms are more accurate than models based on conventional alignment methods such as FASTA or BLAST, which are still widely used for MR. In addition, by designing MR pipelines that integrate phasing and automated refinement and allow parallel processing of such calculations, one can effectively increase the success rate of MR. Here, updated results from the JCSG MR pipeline are presented, which to date has solved 33 MR structures with less than 35% sequence identity to the closest homologue of known structure. By using difficult MR problems as examples, it is demonstrated that successful MR phasing is possible even in cases where the similarity between the model and the template can only be detected with fold-recognition algorithms. In the first step, several search models are built based on all homologues found in the PDB by fold-recognition algorithms. The models resulting from this process are used in parallel MR searches with different combinations of input parameters of the MR phasing algorithm. The putative solutions are subjected to rigid-body and restrained crystallographic refinement and ranked based on the final values of free R factor, figure of merit and deviations from ideal geometry. Finally, crystal packing and electron-density maps are checked to identify the correct solution. If this procedure does not yield a solution with interpretable electron-density maps, then even more alternative models are prepared. The structurally variable regions of a protein family are identified based on alignments of sequences and known structures from that family and appropriate trimmings of the models are proposed. All combinations of these trimmings are applied to the search models and the resulting set of models is used in the MR pipeline. It is estimated that with the improvements in model building and exhaustive parallel searches with existing phasing algorithms, MR can be successful for more than 50% of recognizable homologues of known structures below the threshold of 35% sequence identity. This implies that about one-third of the proteins in a typical bacterial proteome are potential MR targets. |
format | Text |
id | pubmed-2394805 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | International Union of Crystallography |
record_format | MEDLINE/PubMed |
spelling | pubmed-23948052009-03-05 The JCSG MR pipeline: optimized alignments, multiple models and parallel searches Schwarzenbacher, Robert Godzik, Adam Jaroszewski, Lukasz Acta Crystallogr D Biol Crystallogr Research Papers The success rate of molecular replacement (MR) falls considerably when search models share less than 35% sequence identity with their templates, but can be improved significantly by using fold-recognition methods combined with exhaustive MR searches. Models based on alignments calculated with fold-recognition algorithms are more accurate than models based on conventional alignment methods such as FASTA or BLAST, which are still widely used for MR. In addition, by designing MR pipelines that integrate phasing and automated refinement and allow parallel processing of such calculations, one can effectively increase the success rate of MR. Here, updated results from the JCSG MR pipeline are presented, which to date has solved 33 MR structures with less than 35% sequence identity to the closest homologue of known structure. By using difficult MR problems as examples, it is demonstrated that successful MR phasing is possible even in cases where the similarity between the model and the template can only be detected with fold-recognition algorithms. In the first step, several search models are built based on all homologues found in the PDB by fold-recognition algorithms. The models resulting from this process are used in parallel MR searches with different combinations of input parameters of the MR phasing algorithm. The putative solutions are subjected to rigid-body and restrained crystallographic refinement and ranked based on the final values of free R factor, figure of merit and deviations from ideal geometry. Finally, crystal packing and electron-density maps are checked to identify the correct solution. If this procedure does not yield a solution with interpretable electron-density maps, then even more alternative models are prepared. The structurally variable regions of a protein family are identified based on alignments of sequences and known structures from that family and appropriate trimmings of the models are proposed. All combinations of these trimmings are applied to the search models and the resulting set of models is used in the MR pipeline. It is estimated that with the improvements in model building and exhaustive parallel searches with existing phasing algorithms, MR can be successful for more than 50% of recognizable homologues of known structures below the threshold of 35% sequence identity. This implies that about one-third of the proteins in a typical bacterial proteome are potential MR targets. International Union of Crystallography 2008-01-01 2007-12-04 /pmc/articles/PMC2394805/ /pubmed/18094477 http://dx.doi.org/10.1107/S0907444907050111 Text en © International Union of Crystallography 2008 http://journals.iucr.org/services/termsofuse.html This is an open-access article distributed under the terms described at http://journals.iucr.org/services/termsofuse.html. |
spellingShingle | Research Papers Schwarzenbacher, Robert Godzik, Adam Jaroszewski, Lukasz The JCSG MR pipeline: optimized alignments, multiple models and parallel searches |
title | The JCSG MR pipeline: optimized alignments, multiple models and parallel searches |
title_full | The JCSG MR pipeline: optimized alignments, multiple models and parallel searches |
title_fullStr | The JCSG MR pipeline: optimized alignments, multiple models and parallel searches |
title_full_unstemmed | The JCSG MR pipeline: optimized alignments, multiple models and parallel searches |
title_short | The JCSG MR pipeline: optimized alignments, multiple models and parallel searches |
title_sort | jcsg mr pipeline: optimized alignments, multiple models and parallel searches |
topic | Research Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2394805/ https://www.ncbi.nlm.nih.gov/pubmed/18094477 http://dx.doi.org/10.1107/S0907444907050111 |
work_keys_str_mv | AT schwarzenbacherrobert thejcsgmrpipelineoptimizedalignmentsmultiplemodelsandparallelsearches AT godzikadam thejcsgmrpipelineoptimizedalignmentsmultiplemodelsandparallelsearches AT jaroszewskilukasz thejcsgmrpipelineoptimizedalignmentsmultiplemodelsandparallelsearches AT schwarzenbacherrobert jcsgmrpipelineoptimizedalignmentsmultiplemodelsandparallelsearches AT godzikadam jcsgmrpipelineoptimizedalignmentsmultiplemodelsandparallelsearches AT jaroszewskilukasz jcsgmrpipelineoptimizedalignmentsmultiplemodelsandparallelsearches |