Cargando…
Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
Phasing by single-wavelength anomalous diffraction (SAD) from multiple crystallographic data sets can be particularly demanding because of the weak anomalous signal and possible non-isomorphism. The identification and exclusion of non-isomorphous data sets by suitable indicators is therefore indispe...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
International Union of Crystallography
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7336379/ https://www.ncbi.nlm.nih.gov/pubmed/32627737 http://dx.doi.org/10.1107/S2059798320006348 |
_version_ | 1783554306615541760 |
---|---|
author | Assmann, Greta M. Wang, Meitian Diederichs, Kay |
author_facet | Assmann, Greta M. Wang, Meitian Diederichs, Kay |
author_sort | Assmann, Greta M. |
collection | PubMed |
description | Phasing by single-wavelength anomalous diffraction (SAD) from multiple crystallographic data sets can be particularly demanding because of the weak anomalous signal and possible non-isomorphism. The identification and exclusion of non-isomorphous data sets by suitable indicators is therefore indispensable. Here, simple and robust data-selection methods are described. A multi-dimensional scaling procedure is first used to identify data sets with large non-isomorphism relative to clusters of other data sets. Within each cluster that it identifies, further selection is based on the weighted ΔCC(1/2), a quantity representing the influence of a set of reflections on the overall CC(1/2) of the merged data. The anomalous signal is further improved by optimizing the scaling protocol. The success of iterating the selection and scaling steps was verified by substructure determination and subsequent structure solution. Three serial synchrotron crystallography (SSX) SAD test cases with hundreds of partial data sets and one test case with 62 complete data sets were analyzed. Structure solution was dramatically simplified with this procedure, and enabled solution of the structures after a few selection/scaling iterations. To explore the limits, the procedure was tested with much fewer data than originally required and could still solve the structure in several cases. In addition, an SSX data challenge, minimizing the number of (simulated) data sets necessary to solve the structure, was significantly underbid. |
format | Online Article Text |
id | pubmed-7336379 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | International Union of Crystallography |
record_format | MEDLINE/PubMed |
spelling | pubmed-73363792020-07-17 Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods Assmann, Greta M. Wang, Meitian Diederichs, Kay Acta Crystallogr D Struct Biol Research Papers Phasing by single-wavelength anomalous diffraction (SAD) from multiple crystallographic data sets can be particularly demanding because of the weak anomalous signal and possible non-isomorphism. The identification and exclusion of non-isomorphous data sets by suitable indicators is therefore indispensable. Here, simple and robust data-selection methods are described. A multi-dimensional scaling procedure is first used to identify data sets with large non-isomorphism relative to clusters of other data sets. Within each cluster that it identifies, further selection is based on the weighted ΔCC(1/2), a quantity representing the influence of a set of reflections on the overall CC(1/2) of the merged data. The anomalous signal is further improved by optimizing the scaling protocol. The success of iterating the selection and scaling steps was verified by substructure determination and subsequent structure solution. Three serial synchrotron crystallography (SSX) SAD test cases with hundreds of partial data sets and one test case with 62 complete data sets were analyzed. Structure solution was dramatically simplified with this procedure, and enabled solution of the structures after a few selection/scaling iterations. To explore the limits, the procedure was tested with much fewer data than originally required and could still solve the structure in several cases. In addition, an SSX data challenge, minimizing the number of (simulated) data sets necessary to solve the structure, was significantly underbid. International Union of Crystallography 2020-06-17 /pmc/articles/PMC7336379/ /pubmed/32627737 http://dx.doi.org/10.1107/S2059798320006348 Text en © Assmann et al. 2020 http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Research Papers Assmann, Greta M. Wang, Meitian Diederichs, Kay Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods |
title | Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods |
title_full | Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods |
title_fullStr | Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods |
title_full_unstemmed | Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods |
title_short | Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods |
title_sort | making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods |
topic | Research Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7336379/ https://www.ncbi.nlm.nih.gov/pubmed/32627737 http://dx.doi.org/10.1107/S2059798320006348 |
work_keys_str_mv | AT assmanngretam makingadifferenceinmultidatasetcrystallographysimpleanddeterministicdatascalingselectionmethods AT wangmeitian makingadifferenceinmultidatasetcrystallographysimpleanddeterministicdatascalingselectionmethods AT diederichskay makingadifferenceinmultidatasetcrystallographysimpleanddeterministicdatascalingselectionmethods |