Cargando…

PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank

While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the...

Descripción completa

Detalles Bibliográficos
Autores principales: Kunnakkattu, Ibrahim Roshan, Choudhary, Preeti, Pravda, Lukas, Nadzirin, Nurul, Smart, Oliver S., Yuan, Qi, Anyango, Stephen, Nair, Sreenath, Varadi, Mihaly, Velankar, Sameer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693035/
https://www.ncbi.nlm.nih.gov/pubmed/38042830
http://dx.doi.org/10.1186/s13321-023-00786-w
_version_ 1785153069288259584
author Kunnakkattu, Ibrahim Roshan
Choudhary, Preeti
Pravda, Lukas
Nadzirin, Nurul
Smart, Oliver S.
Yuan, Qi
Anyango, Stephen
Nair, Sreenath
Varadi, Mihaly
Velankar, Sameer
author_facet Kunnakkattu, Ibrahim Roshan
Choudhary, Preeti
Pravda, Lukas
Nadzirin, Nurul
Smart, Oliver S.
Yuan, Qi
Anyango, Stephen
Nair, Sreenath
Varadi, Mihaly
Velankar, Sameer
author_sort Kunnakkattu, Ibrahim Roshan
collection PubMed
description While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the PDB in PDBx/mmCIF format. PDBe CCDUtils provides streamlined access to all the metadata for small molecules in the PDB and offers a set of convenient methods to compute various properties using RDKit, such as 2D depictions, 3D conformers, physicochemical properties, scaffolds, common fragments, and cross-references to small molecule databases using UniChem. The toolkit also provides methods for identifying all the covalently attached chemical components in a macromolecular structure and calculating similarity among small molecules. By providing a broad range of functionality, PDBe CCDUtils caters to the needs of researchers in cheminformatics, structural biology, bioinformatics and computational chemistry. GRAPHICAL ABSTRACT: [Image: see text]
format Online
Article
Text
id pubmed-10693035
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-106930352023-12-03 PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank Kunnakkattu, Ibrahim Roshan Choudhary, Preeti Pravda, Lukas Nadzirin, Nurul Smart, Oliver S. Yuan, Qi Anyango, Stephen Nair, Sreenath Varadi, Mihaly Velankar, Sameer J Cheminform Software While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the PDB in PDBx/mmCIF format. PDBe CCDUtils provides streamlined access to all the metadata for small molecules in the PDB and offers a set of convenient methods to compute various properties using RDKit, such as 2D depictions, 3D conformers, physicochemical properties, scaffolds, common fragments, and cross-references to small molecule databases using UniChem. The toolkit also provides methods for identifying all the covalently attached chemical components in a macromolecular structure and calculating similarity among small molecules. By providing a broad range of functionality, PDBe CCDUtils caters to the needs of researchers in cheminformatics, structural biology, bioinformatics and computational chemistry. GRAPHICAL ABSTRACT: [Image: see text] Springer International Publishing 2023-12-02 /pmc/articles/PMC10693035/ /pubmed/38042830 http://dx.doi.org/10.1186/s13321-023-00786-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Kunnakkattu, Ibrahim Roshan
Choudhary, Preeti
Pravda, Lukas
Nadzirin, Nurul
Smart, Oliver S.
Yuan, Qi
Anyango, Stephen
Nair, Sreenath
Varadi, Mihaly
Velankar, Sameer
PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank
title PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank
title_full PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank
title_fullStr PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank
title_full_unstemmed PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank
title_short PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank
title_sort pdbe ccdutils: an rdkit-based toolkit for handling and analysing small molecules in the protein data bank
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693035/
https://www.ncbi.nlm.nih.gov/pubmed/38042830
http://dx.doi.org/10.1186/s13321-023-00786-w
work_keys_str_mv AT kunnakkattuibrahimroshan pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT choudharypreeti pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT pravdalukas pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT nadzirinnurul pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT smartolivers pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT yuanqi pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT anyangostephen pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT nairsreenath pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT varadimihaly pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank
AT velankarsameer pdbeccdutilsanrdkitbasedtoolkitforhandlingandanalysingsmallmoleculesintheproteindatabank