Cargando…

Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods

Phasing by single-wavelength anomalous diffraction (SAD) from multiple crystallographic data sets can be particularly demanding because of the weak anomalous signal and possible non-isomorphism. The identification and exclusion of non-isomorphous data sets by suitable indicators is therefore indispe...

Descripción completa

Detalles Bibliográficos
Autores principales: Assmann, Greta M., Wang, Meitian, Diederichs, Kay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Union of Crystallography 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7336379/
https://www.ncbi.nlm.nih.gov/pubmed/32627737
http://dx.doi.org/10.1107/S2059798320006348
_version_ 1783554306615541760
author Assmann, Greta M.
Wang, Meitian
Diederichs, Kay
author_facet Assmann, Greta M.
Wang, Meitian
Diederichs, Kay
author_sort Assmann, Greta M.
collection PubMed
description Phasing by single-wavelength anomalous diffraction (SAD) from multiple crystallographic data sets can be particularly demanding because of the weak anomalous signal and possible non-isomorphism. The identification and exclusion of non-isomorphous data sets by suitable indicators is therefore indispensable. Here, simple and robust data-selection methods are described. A multi-dimensional scaling procedure is first used to identify data sets with large non-isomorphism relative to clusters of other data sets. Within each cluster that it identifies, further selection is based on the weighted ΔCC(1/2), a quantity representing the influence of a set of reflections on the overall CC(1/2) of the merged data. The anomalous signal is further improved by optimizing the scaling protocol. The success of iterating the selection and scaling steps was verified by substructure determination and subsequent structure solution. Three serial synchrotron crystallography (SSX) SAD test cases with hundreds of partial data sets and one test case with 62 complete data sets were analyzed. Structure solution was dramatically simplified with this procedure, and enabled solution of the structures after a few selection/scaling iterations. To explore the limits, the procedure was tested with much fewer data than originally required and could still solve the structure in several cases. In addition, an SSX data challenge, minimizing the number of (simulated) data sets necessary to solve the structure, was significantly underbid.
format Online
Article
Text
id pubmed-7336379
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher International Union of Crystallography
record_format MEDLINE/PubMed
spelling pubmed-73363792020-07-17 Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods Assmann, Greta M. Wang, Meitian Diederichs, Kay Acta Crystallogr D Struct Biol Research Papers Phasing by single-wavelength anomalous diffraction (SAD) from multiple crystallographic data sets can be particularly demanding because of the weak anomalous signal and possible non-isomorphism. The identification and exclusion of non-isomorphous data sets by suitable indicators is therefore indispensable. Here, simple and robust data-selection methods are described. A multi-dimensional scaling procedure is first used to identify data sets with large non-isomorphism relative to clusters of other data sets. Within each cluster that it identifies, further selection is based on the weighted ΔCC(1/2), a quantity representing the influence of a set of reflections on the overall CC(1/2) of the merged data. The anomalous signal is further improved by optimizing the scaling protocol. The success of iterating the selection and scaling steps was verified by substructure determination and subsequent structure solution. Three serial synchrotron crystallography (SSX) SAD test cases with hundreds of partial data sets and one test case with 62 complete data sets were analyzed. Structure solution was dramatically simplified with this procedure, and enabled solution of the structures after a few selection/scaling iterations. To explore the limits, the procedure was tested with much fewer data than originally required and could still solve the structure in several cases. In addition, an SSX data challenge, minimizing the number of (simulated) data sets necessary to solve the structure, was significantly underbid. International Union of Crystallography 2020-06-17 /pmc/articles/PMC7336379/ /pubmed/32627737 http://dx.doi.org/10.1107/S2059798320006348 Text en © Assmann et al. 2020 http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.http://creativecommons.org/licenses/by/4.0/
spellingShingle Research Papers
Assmann, Greta M.
Wang, Meitian
Diederichs, Kay
Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
title Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
title_full Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
title_fullStr Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
title_full_unstemmed Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
title_short Making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
title_sort making a difference in multi-data-set crystallography: simple and deterministic data-scaling/selection methods
topic Research Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7336379/
https://www.ncbi.nlm.nih.gov/pubmed/32627737
http://dx.doi.org/10.1107/S2059798320006348
work_keys_str_mv AT assmanngretam makingadifferenceinmultidatasetcrystallographysimpleanddeterministicdatascalingselectionmethods
AT wangmeitian makingadifferenceinmultidatasetcrystallographysimpleanddeterministicdatascalingselectionmethods
AT diederichskay makingadifferenceinmultidatasetcrystallographysimpleanddeterministicdatascalingselectionmethods