Cargando…
The missing link: covalent linkages in structural models
Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifyi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
International Union of Crystallography
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171067/ https://www.ncbi.nlm.nih.gov/pubmed/34076588 http://dx.doi.org/10.1107/S2059798321003934 |
_version_ | 1783702359293034496 |
---|---|
author | Nicholls, Robert A. Wojdyr, Marcin Joosten, Robbie P. Catapano, Lucrezia Long, Fei Fischer, Marcus Emsley, Paul Murshudov, Garib N. |
author_facet | Nicholls, Robert A. Wojdyr, Marcin Joosten, Robbie P. Catapano, Lucrezia Long, Fei Fischer, Marcus Emsley, Paul Murshudov, Garib N. |
author_sort | Nicholls, Robert A. |
collection | PubMed |
description | Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifying the correct chemistry, obtaining an appropriate restraint dictionary and ensuring its correct application. The analysis presented herein assesses the extent of problems involving covalent linkages in the Protein Data Bank (PDB). Not only will this facilitate the remediation of existing models, but also, more importantly, it will inform and thus improve the quality of future linkages. By considering linkages of known type in the CCP4 Monomer Library (CCP4-ML), failure to model a covalent linkage is identified to result in inaccurate (systematically longer) interatomic distances. Scanning the PDB for proximal atom pairs that do not have a corresponding type in the CCP4-ML reveals a large number of commonly occurring types of unannotated potential linkages; in general, these may or may not be covalently linked. Manual consideration of the most commonly occurring cases identifies a number of genuine classes of covalent linkages. The recent expansion of the CCP4-ML is discussed, which has involved the addition of over 16 000 and the replacement of over 11 000 component dictionaries using AceDRG. As part of this effort, the CCP4-ML has also been extended using AceDRG link dictionaries for the aforementioned linkage types identified in this analysis. This will facilitate the identification of such linkage types in future modelling efforts, whilst concurrently easing the process involved in their application. The need for a universal standard for maintaining link records corresponding to covalent linkages, and references to the associated dictionaries used during modelling and refinement, following deposition to the PDB is emphasized. The importance of correctly modelling covalent linkages is demonstrated using a case study, which involves the covalent linkage of an inhibitor to the main protease in various viral species, including SARS-CoV-2. This example demonstrates the importance of properly modelling covalent linkages using a comprehensive restraint dictionary, as opposed to just using a single interatomic distance restraint or failing to model the covalent linkage at all. |
format | Online Article Text |
id | pubmed-8171067 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | International Union of Crystallography |
record_format | MEDLINE/PubMed |
spelling | pubmed-81710672021-06-14 The missing link: covalent linkages in structural models Nicholls, Robert A. Wojdyr, Marcin Joosten, Robbie P. Catapano, Lucrezia Long, Fei Fischer, Marcus Emsley, Paul Murshudov, Garib N. Acta Crystallogr D Struct Biol Ccp4 Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifying the correct chemistry, obtaining an appropriate restraint dictionary and ensuring its correct application. The analysis presented herein assesses the extent of problems involving covalent linkages in the Protein Data Bank (PDB). Not only will this facilitate the remediation of existing models, but also, more importantly, it will inform and thus improve the quality of future linkages. By considering linkages of known type in the CCP4 Monomer Library (CCP4-ML), failure to model a covalent linkage is identified to result in inaccurate (systematically longer) interatomic distances. Scanning the PDB for proximal atom pairs that do not have a corresponding type in the CCP4-ML reveals a large number of commonly occurring types of unannotated potential linkages; in general, these may or may not be covalently linked. Manual consideration of the most commonly occurring cases identifies a number of genuine classes of covalent linkages. The recent expansion of the CCP4-ML is discussed, which has involved the addition of over 16 000 and the replacement of over 11 000 component dictionaries using AceDRG. As part of this effort, the CCP4-ML has also been extended using AceDRG link dictionaries for the aforementioned linkage types identified in this analysis. This will facilitate the identification of such linkage types in future modelling efforts, whilst concurrently easing the process involved in their application. The need for a universal standard for maintaining link records corresponding to covalent linkages, and references to the associated dictionaries used during modelling and refinement, following deposition to the PDB is emphasized. The importance of correctly modelling covalent linkages is demonstrated using a case study, which involves the covalent linkage of an inhibitor to the main protease in various viral species, including SARS-CoV-2. This example demonstrates the importance of properly modelling covalent linkages using a comprehensive restraint dictionary, as opposed to just using a single interatomic distance restraint or failing to model the covalent linkage at all. International Union of Crystallography 2021-05-19 /pmc/articles/PMC8171067/ /pubmed/34076588 http://dx.doi.org/10.1107/S2059798321003934 Text en © Nicholls et al. 2021 https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited. |
spellingShingle | Ccp4 Nicholls, Robert A. Wojdyr, Marcin Joosten, Robbie P. Catapano, Lucrezia Long, Fei Fischer, Marcus Emsley, Paul Murshudov, Garib N. The missing link: covalent linkages in structural models |
title | The missing link: covalent linkages in structural models |
title_full | The missing link: covalent linkages in structural models |
title_fullStr | The missing link: covalent linkages in structural models |
title_full_unstemmed | The missing link: covalent linkages in structural models |
title_short | The missing link: covalent linkages in structural models |
title_sort | missing link: covalent linkages in structural models |
topic | Ccp4 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171067/ https://www.ncbi.nlm.nih.gov/pubmed/34076588 http://dx.doi.org/10.1107/S2059798321003934 |
work_keys_str_mv | AT nichollsroberta themissinglinkcovalentlinkagesinstructuralmodels AT wojdyrmarcin themissinglinkcovalentlinkagesinstructuralmodels AT joostenrobbiep themissinglinkcovalentlinkagesinstructuralmodels AT catapanolucrezia themissinglinkcovalentlinkagesinstructuralmodels AT longfei themissinglinkcovalentlinkagesinstructuralmodels AT fischermarcus themissinglinkcovalentlinkagesinstructuralmodels AT emsleypaul themissinglinkcovalentlinkagesinstructuralmodels AT murshudovgaribn themissinglinkcovalentlinkagesinstructuralmodels AT nichollsroberta missinglinkcovalentlinkagesinstructuralmodels AT wojdyrmarcin missinglinkcovalentlinkagesinstructuralmodels AT joostenrobbiep missinglinkcovalentlinkagesinstructuralmodels AT catapanolucrezia missinglinkcovalentlinkagesinstructuralmodels AT longfei missinglinkcovalentlinkagesinstructuralmodels AT fischermarcus missinglinkcovalentlinkagesinstructuralmodels AT emsleypaul missinglinkcovalentlinkagesinstructuralmodels AT murshudovgaribn missinglinkcovalentlinkagesinstructuralmodels |