Cargando…
PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank
While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693035/ https://www.ncbi.nlm.nih.gov/pubmed/38042830 http://dx.doi.org/10.1186/s13321-023-00786-w |
_version_ | 1785153069288259584 |
---|---|
author | Kunnakkattu, Ibrahim Roshan Choudhary, Preeti Pravda, Lukas Nadzirin, Nurul Smart, Oliver S. Yuan, Qi Anyango, Stephen Nair, Sreenath Varadi, Mihaly Velankar, Sameer |
author_facet | Kunnakkattu, Ibrahim Roshan Choudhary, Preeti Pravda, Lukas Nadzirin, Nurul Smart, Oliver S. Yuan, Qi Anyango, Stephen Nair, Sreenath Varadi, Mihaly Velankar, Sameer |
author_sort | Kunnakkattu, Ibrahim Roshan |
collection | PubMed |
description | While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the PDB in PDBx/mmCIF format. PDBe CCDUtils provides streamlined access to all the metadata for small molecules in the PDB and offers a set of convenient methods to compute various properties using RDKit, such as 2D depictions, 3D conformers, physicochemical properties, scaffolds, common fragments, and cross-references to small molecule databases using UniChem. The toolkit also provides methods for identifying all the covalently attached chemical components in a macromolecular structure and calculating similarity among small molecules. By providing a broad range of functionality, PDBe CCDUtils caters to the needs of researchers in cheminformatics, structural biology, bioinformatics and computational chemistry. GRAPHICAL ABSTRACT: [Image: see text] |
format | Online Article Text |
id | pubmed-10693035 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-106930352023-12-03 PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank Kunnakkattu, Ibrahim Roshan Choudhary, Preeti Pravda, Lukas Nadzirin, Nurul Smart, Oliver S. Yuan, Qi Anyango, Stephen Nair, Sreenath Varadi, Mihaly Velankar, Sameer J Cheminform Software While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the PDB in PDBx/mmCIF format. PDBe CCDUtils provides streamlined access to all the metadata for small molecules in the PDB and offers a set of convenient methods to compute various properties using RDKit, such as 2D depictions, 3D conformers, physicochemical properties, scaffolds, common fragments, and cross-references to small molecule databases using UniChem. The toolkit also provides methods for identifying all the covalently attached chemical components in a macromolecular structure and calculating similarity among small molecules. By providing a broad range of functionality, PDBe CCDUtils caters to the needs of researchers in cheminformatics, structural biology, bioinformatics and computational chemistry. GRAPHICAL ABSTRACT: [Image: see text] Springer International Publishing 2023-12-02 /pmc/articles/PMC10693035/ /pubmed/38042830 http://dx.doi.org/10.1186/s13321-023-00786-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Kunnakkattu, Ibrahim Roshan Choudhary, Preeti Pravda, Lukas Nadzirin, Nurul Smart, Oliver S. Yuan, Qi Anyango, Stephen Nair, Sreenath Varadi, Mihaly Velankar, Sameer PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank |
title | PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank |
title_full | PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank |
title_fullStr | PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank |
title_full_unstemmed | PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank |
title_short | PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank |
title_sort | pdbe ccdutils: an rdkit-based toolkit for handling and analysing small molecules in the protein data bank |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693035/ https://www.ncbi.nlm.nih.gov/pubmed/38042830 http://dx.doi.org/10.1186/s13321-023-00786-w |
work_keys_str_mv | AT kunnakkattuibrahimroshan pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT choudharypreeti pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT pravdalukas pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT nadzirinnurul pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT smartolivers pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT yuanqi pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT anyangostephen pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT nairsreenath pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT varadimihaly pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank AT velankarsameer pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank |