Cargando…

Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint

None of the current data processing pipelines for X-ray crystallography fragment-based lead discovery (FBLD) consults all the information available when deciding on the lattice and symmetry (i.e., the polymorph) of each soaked crystal. Often, X-ray crystallography FBLD pipelines either choose the po...

Descripción completa

Detalles Bibliográficos
Autores principales: Caputo, Alessandro T., Ibba, Roberta, Le Cornu, James D., Darlot, Benoit, Hensen, Mario, Lipp, Colette B., Marcianò, Gabriele, Vasiljević, Snežana, Zitzmann, Nicole, Roversi, Pietro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9794592/
https://www.ncbi.nlm.nih.gov/pubmed/36589243
http://dx.doi.org/10.3389/fmolb.2022.960248
_version_ 1784860071664025600
author Caputo, Alessandro T.
Ibba, Roberta
Le Cornu, James D.
Darlot, Benoit
Hensen, Mario
Lipp, Colette B.
Marcianò, Gabriele
Vasiljević, Snežana
Zitzmann, Nicole
Roversi, Pietro
author_facet Caputo, Alessandro T.
Ibba, Roberta
Le Cornu, James D.
Darlot, Benoit
Hensen, Mario
Lipp, Colette B.
Marcianò, Gabriele
Vasiljević, Snežana
Zitzmann, Nicole
Roversi, Pietro
author_sort Caputo, Alessandro T.
collection PubMed
description None of the current data processing pipelines for X-ray crystallography fragment-based lead discovery (FBLD) consults all the information available when deciding on the lattice and symmetry (i.e., the polymorph) of each soaked crystal. Often, X-ray crystallography FBLD pipelines either choose the polymorph based on cell volume and point-group symmetry of the X-ray diffraction data or leave polymorph attribution to manual intervention on the part of the user. Thus, when the FBLD crystals belong to more than one crystal polymorph, the discovery pipeline can be plagued by space group ambiguity, especially if the polymorphs at hand are variations of the same lattice and, therefore, difficult to tell apart from their morphology and/or their apparent crystal lattices and point groups. In the course of a fragment-based lead discovery effort aimed at finding ligands of the catalytic domain of UDP–glucose glycoprotein glucosyltransferase (UGGT), we encountered a mixture of trigonal crystals and pseudotrigonal triclinic crystals—with the two lattices closely related. In order to resolve that polymorphism ambiguity, we have written and described here a series of Unix shell scripts called CoALLA (crystal polymorph and ligand likelihood-based assignment). The CoALLA scripts are written in Unix shell and use autoPROC for data processing, CCP4-Dimple/REFMAC5 and BUSTER for refinement, and RHOFIT for ligand docking. The choice of the polymorph is effected by carrying out (in each of the known polymorphs) the tasks of diffraction data indexing, integration, scaling, and structural refinement. The most likely polymorph is then chosen as the one with the best structure refinement R(free) statistic. The CoALLA scripts further implement a likelihood-based ligand assignment strategy, starting with macromolecular refinement and automated water addition, followed by removal of the water molecules that appear to be fitting ligand density, and a final round of refinement after random perturbation of the refined macromolecular model, in order to obtain unbiased difference density maps for automated ligand placement. We illustrate the use of CoALLA to discriminate between H3 and P1 crystals used for an FBLD effort to find fragments binding to the catalytic domain of Chaetomium thermophilum UGGT.
format Online
Article
Text
id pubmed-9794592
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97945922022-12-29 Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint Caputo, Alessandro T. Ibba, Roberta Le Cornu, James D. Darlot, Benoit Hensen, Mario Lipp, Colette B. Marcianò, Gabriele Vasiljević, Snežana Zitzmann, Nicole Roversi, Pietro Front Mol Biosci Molecular Biosciences None of the current data processing pipelines for X-ray crystallography fragment-based lead discovery (FBLD) consults all the information available when deciding on the lattice and symmetry (i.e., the polymorph) of each soaked crystal. Often, X-ray crystallography FBLD pipelines either choose the polymorph based on cell volume and point-group symmetry of the X-ray diffraction data or leave polymorph attribution to manual intervention on the part of the user. Thus, when the FBLD crystals belong to more than one crystal polymorph, the discovery pipeline can be plagued by space group ambiguity, especially if the polymorphs at hand are variations of the same lattice and, therefore, difficult to tell apart from their morphology and/or their apparent crystal lattices and point groups. In the course of a fragment-based lead discovery effort aimed at finding ligands of the catalytic domain of UDP–glucose glycoprotein glucosyltransferase (UGGT), we encountered a mixture of trigonal crystals and pseudotrigonal triclinic crystals—with the two lattices closely related. In order to resolve that polymorphism ambiguity, we have written and described here a series of Unix shell scripts called CoALLA (crystal polymorph and ligand likelihood-based assignment). The CoALLA scripts are written in Unix shell and use autoPROC for data processing, CCP4-Dimple/REFMAC5 and BUSTER for refinement, and RHOFIT for ligand docking. The choice of the polymorph is effected by carrying out (in each of the known polymorphs) the tasks of diffraction data indexing, integration, scaling, and structural refinement. The most likely polymorph is then chosen as the one with the best structure refinement R(free) statistic. The CoALLA scripts further implement a likelihood-based ligand assignment strategy, starting with macromolecular refinement and automated water addition, followed by removal of the water molecules that appear to be fitting ligand density, and a final round of refinement after random perturbation of the refined macromolecular model, in order to obtain unbiased difference density maps for automated ligand placement. We illustrate the use of CoALLA to discriminate between H3 and P1 crystals used for an FBLD effort to find fragments binding to the catalytic domain of Chaetomium thermophilum UGGT. Frontiers Media S.A. 2022-12-14 /pmc/articles/PMC9794592/ /pubmed/36589243 http://dx.doi.org/10.3389/fmolb.2022.960248 Text en Copyright © 2022 Caputo, Ibba, Le Cornu, Darlot, Hensen, Lipp, Marcianò, Vasiljević, Zitzmann and Roversi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Caputo, Alessandro T.
Ibba, Roberta
Le Cornu, James D.
Darlot, Benoit
Hensen, Mario
Lipp, Colette B.
Marcianò, Gabriele
Vasiljević, Snežana
Zitzmann, Nicole
Roversi, Pietro
Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint
title Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint
title_full Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint
title_fullStr Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint
title_full_unstemmed Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint
title_short Crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of UGGT, the glycoprotein folding quality control checkpoint
title_sort crystal polymorphism in fragment-based lead discovery of ligands of the catalytic domain of uggt, the glycoprotein folding quality control checkpoint
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9794592/
https://www.ncbi.nlm.nih.gov/pubmed/36589243
http://dx.doi.org/10.3389/fmolb.2022.960248
work_keys_str_mv AT caputoalessandrot crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT ibbaroberta crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT lecornujamesd crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT darlotbenoit crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT hensenmario crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT lippcoletteb crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT marcianogabriele crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT vasiljevicsnezana crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT zitzmannnicole crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint
AT roversipietro crystalpolymorphisminfragmentbasedleaddiscoveryofligandsofthecatalyticdomainofuggttheglycoproteinfoldingqualitycontrolcheckpoint