Cargando…

The missing link: covalent linkages in structural models

Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifyi...

Descripción completa

Detalles Bibliográficos
Autores principales: Nicholls, Robert A., Wojdyr, Marcin, Joosten, Robbie P., Catapano, Lucrezia, Long, Fei, Fischer, Marcus, Emsley, Paul, Murshudov, Garib N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Union of Crystallography 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171067/
https://www.ncbi.nlm.nih.gov/pubmed/34076588
http://dx.doi.org/10.1107/S2059798321003934
_version_ 1783702359293034496
author Nicholls, Robert A.
Wojdyr, Marcin
Joosten, Robbie P.
Catapano, Lucrezia
Long, Fei
Fischer, Marcus
Emsley, Paul
Murshudov, Garib N.
author_facet Nicholls, Robert A.
Wojdyr, Marcin
Joosten, Robbie P.
Catapano, Lucrezia
Long, Fei
Fischer, Marcus
Emsley, Paul
Murshudov, Garib N.
author_sort Nicholls, Robert A.
collection PubMed
description Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifying the correct chemistry, obtaining an appropriate restraint dictionary and ensuring its correct application. The analysis presented herein assesses the extent of problems involving covalent linkages in the Protein Data Bank (PDB). Not only will this facilitate the remediation of existing models, but also, more importantly, it will inform and thus improve the quality of future linkages. By considering linkages of known type in the CCP4 Monomer Library (CCP4-ML), failure to model a covalent linkage is identified to result in inaccurate (systematically longer) interatomic distances. Scanning the PDB for proximal atom pairs that do not have a corresponding type in the CCP4-ML reveals a large number of commonly occurring types of unannotated potential linkages; in general, these may or may not be covalently linked. Manual consideration of the most commonly occurring cases identifies a number of genuine classes of covalent linkages. The recent expansion of the CCP4-ML is discussed, which has involved the addition of over 16 000 and the replacement of over 11 000 component dictionaries using AceDRG. As part of this effort, the CCP4-ML has also been extended using AceDRG link dictionaries for the aforementioned linkage types identified in this analysis. This will facilitate the identification of such linkage types in future modelling efforts, whilst concurrently easing the process involved in their application. The need for a universal standard for maintaining link records corresponding to covalent linkages, and references to the associated dictionaries used during modelling and refinement, following deposition to the PDB is emphasized. The importance of correctly modelling covalent linkages is demonstrated using a case study, which involves the covalent linkage of an inhibitor to the main protease in various viral species, including SARS-CoV-2. This example demonstrates the importance of properly modelling covalent linkages using a comprehensive restraint dictionary, as opposed to just using a single interatomic distance restraint or failing to model the covalent linkage at all.
format Online
Article
Text
id pubmed-8171067
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher International Union of Crystallography
record_format MEDLINE/PubMed
spelling pubmed-81710672021-06-14 The missing link: covalent linkages in structural models Nicholls, Robert A. Wojdyr, Marcin Joosten, Robbie P. Catapano, Lucrezia Long, Fei Fischer, Marcus Emsley, Paul Murshudov, Garib N. Acta Crystallogr D Struct Biol Ccp4 Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifying the correct chemistry, obtaining an appropriate restraint dictionary and ensuring its correct application. The analysis presented herein assesses the extent of problems involving covalent linkages in the Protein Data Bank (PDB). Not only will this facilitate the remediation of existing models, but also, more importantly, it will inform and thus improve the quality of future linkages. By considering linkages of known type in the CCP4 Monomer Library (CCP4-ML), failure to model a covalent linkage is identified to result in inaccurate (systematically longer) interatomic distances. Scanning the PDB for proximal atom pairs that do not have a corresponding type in the CCP4-ML reveals a large number of commonly occurring types of unannotated potential linkages; in general, these may or may not be covalently linked. Manual consideration of the most commonly occurring cases identifies a number of genuine classes of covalent linkages. The recent expansion of the CCP4-ML is discussed, which has involved the addition of over 16 000 and the replacement of over 11 000 component dictionaries using AceDRG. As part of this effort, the CCP4-ML has also been extended using AceDRG link dictionaries for the aforementioned linkage types identified in this analysis. This will facilitate the identification of such linkage types in future modelling efforts, whilst concurrently easing the process involved in their application. The need for a universal standard for maintaining link records corresponding to covalent linkages, and references to the associated dictionaries used during modelling and refinement, following deposition to the PDB is emphasized. The importance of correctly modelling covalent linkages is demonstrated using a case study, which involves the covalent linkage of an inhibitor to the main protease in various viral species, including SARS-CoV-2. This example demonstrates the importance of properly modelling covalent linkages using a comprehensive restraint dictionary, as opposed to just using a single interatomic distance restraint or failing to model the covalent linkage at all. International Union of Crystallography 2021-05-19 /pmc/articles/PMC8171067/ /pubmed/34076588 http://dx.doi.org/10.1107/S2059798321003934 Text en © Nicholls et al. 2021 https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.
spellingShingle Ccp4
Nicholls, Robert A.
Wojdyr, Marcin
Joosten, Robbie P.
Catapano, Lucrezia
Long, Fei
Fischer, Marcus
Emsley, Paul
Murshudov, Garib N.
The missing link: covalent linkages in structural models
title The missing link: covalent linkages in structural models
title_full The missing link: covalent linkages in structural models
title_fullStr The missing link: covalent linkages in structural models
title_full_unstemmed The missing link: covalent linkages in structural models
title_short The missing link: covalent linkages in structural models
title_sort missing link: covalent linkages in structural models
topic Ccp4
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171067/
https://www.ncbi.nlm.nih.gov/pubmed/34076588
http://dx.doi.org/10.1107/S2059798321003934
work_keys_str_mv AT nichollsroberta themissinglinkcovalentlinkagesinstructuralmodels
AT wojdyrmarcin themissinglinkcovalentlinkagesinstructuralmodels
AT joostenrobbiep themissinglinkcovalentlinkagesinstructuralmodels
AT catapanolucrezia themissinglinkcovalentlinkagesinstructuralmodels
AT longfei themissinglinkcovalentlinkagesinstructuralmodels
AT fischermarcus themissinglinkcovalentlinkagesinstructuralmodels
AT emsleypaul themissinglinkcovalentlinkagesinstructuralmodels
AT murshudovgaribn themissinglinkcovalentlinkagesinstructuralmodels
AT nichollsroberta missinglinkcovalentlinkagesinstructuralmodels
AT wojdyrmarcin missinglinkcovalentlinkagesinstructuralmodels
AT joostenrobbiep missinglinkcovalentlinkagesinstructuralmodels
AT catapanolucrezia missinglinkcovalentlinkagesinstructuralmodels
AT longfei missinglinkcovalentlinkagesinstructuralmodels
AT fischermarcus missinglinkcovalentlinkagesinstructuralmodels
AT emsleypaul missinglinkcovalentlinkagesinstructuralmodels
AT murshudovgaribn missinglinkcovalentlinkagesinstructuralmodels