Cargando…

PDB-Explorer: a web-based interactive map of the protein data bank in shape space

BACKGROUND: The RCSB Protein Data Bank (PDB) provides public access to experimentally determined 3D-structures of biological macromolecules (proteins, peptides and nucleic acids). While various tools are available to explore the PDB, options to access the global structural diversity of the entire PD...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Xian, Awale, Mahendra, Zasso, Michaël, Kostro, Daniel, Patiny, Luc, Reymond, Jean-Louis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619230/
https://www.ncbi.nlm.nih.gov/pubmed/26493835
http://dx.doi.org/10.1186/s12859-015-0776-9
_version_ 1782397054403739648
author Jin, Xian
Awale, Mahendra
Zasso, Michaël
Kostro, Daniel
Patiny, Luc
Reymond, Jean-Louis
author_facet Jin, Xian
Awale, Mahendra
Zasso, Michaël
Kostro, Daniel
Patiny, Luc
Reymond, Jean-Louis
author_sort Jin, Xian
collection PubMed
description BACKGROUND: The RCSB Protein Data Bank (PDB) provides public access to experimentally determined 3D-structures of biological macromolecules (proteins, peptides and nucleic acids). While various tools are available to explore the PDB, options to access the global structural diversity of the entire PDB and to perceive relationships between PDB structures remain very limited. METHODS: A 136-dimensional atom pair 3D-fingerprint for proteins (3DP) counting categorized atom pairs at increasing through-space distances was designed to represent the molecular shape of PDB-entries. Nearest neighbor searches examples were reported exemplifying the ability of 3DP-similarity to identify closely related biomolecules from small peptides to enzyme and large multiprotein complexes such as virus particles. The principle component analysis was used to obtain the visualization of PDB in 3DP-space. RESULTS: The 3DP property space groups proteins and protein assemblies according to their 3D-shape similarity, yet shows exquisite ability to distinguish between closely related structures. An interactive website called PDB-Explorer is presented featuring a color-coded interactive map of PDB in 3DP-space. Each pixel of the map contains one or more PDB-entries which are directly visualized as ribbon diagrams when the pixel is selected. The PDB-Explorer website allows performing 3DP-nearest neighbor searches of any PDB-entry or of any structure uploaded as protein-type PDB file. All functionalities on the website are implemented in JavaScript in a platform-independent manner and draw data from a server that is updated daily with the latest PDB additions, ensuring complete and up-to-date coverage. The essentially instantaneous 3DP-similarity search with the PDB-Explorer provides results comparable to those of much slower 3D-alignment algorithms, and automatically clusters proteins from the same superfamilies in tight groups. CONCLUSION: A chemical space classification of PDB based on molecular shape was obtained using a new atom-pair 3D-fingerprint for proteins and implemented in a web-based database exploration tool comprising an interactive color-coded map of the PDB chemical space and a nearest neighbor search tool. The PDB-Explorer website is freely available at www.cheminfo.org/pdbexplorer and represents an unprecedented opportunity to interactively visualize and explore the structural diversity of the PDB. ᅟ: [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0776-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4619230
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46192302015-10-26 PDB-Explorer: a web-based interactive map of the protein data bank in shape space Jin, Xian Awale, Mahendra Zasso, Michaël Kostro, Daniel Patiny, Luc Reymond, Jean-Louis BMC Bioinformatics Research Article BACKGROUND: The RCSB Protein Data Bank (PDB) provides public access to experimentally determined 3D-structures of biological macromolecules (proteins, peptides and nucleic acids). While various tools are available to explore the PDB, options to access the global structural diversity of the entire PDB and to perceive relationships between PDB structures remain very limited. METHODS: A 136-dimensional atom pair 3D-fingerprint for proteins (3DP) counting categorized atom pairs at increasing through-space distances was designed to represent the molecular shape of PDB-entries. Nearest neighbor searches examples were reported exemplifying the ability of 3DP-similarity to identify closely related biomolecules from small peptides to enzyme and large multiprotein complexes such as virus particles. The principle component analysis was used to obtain the visualization of PDB in 3DP-space. RESULTS: The 3DP property space groups proteins and protein assemblies according to their 3D-shape similarity, yet shows exquisite ability to distinguish between closely related structures. An interactive website called PDB-Explorer is presented featuring a color-coded interactive map of PDB in 3DP-space. Each pixel of the map contains one or more PDB-entries which are directly visualized as ribbon diagrams when the pixel is selected. The PDB-Explorer website allows performing 3DP-nearest neighbor searches of any PDB-entry or of any structure uploaded as protein-type PDB file. All functionalities on the website are implemented in JavaScript in a platform-independent manner and draw data from a server that is updated daily with the latest PDB additions, ensuring complete and up-to-date coverage. The essentially instantaneous 3DP-similarity search with the PDB-Explorer provides results comparable to those of much slower 3D-alignment algorithms, and automatically clusters proteins from the same superfamilies in tight groups. CONCLUSION: A chemical space classification of PDB based on molecular shape was obtained using a new atom-pair 3D-fingerprint for proteins and implemented in a web-based database exploration tool comprising an interactive color-coded map of the PDB chemical space and a nearest neighbor search tool. The PDB-Explorer website is freely available at www.cheminfo.org/pdbexplorer and represents an unprecedented opportunity to interactively visualize and explore the structural diversity of the PDB. ᅟ: [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0776-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-23 /pmc/articles/PMC4619230/ /pubmed/26493835 http://dx.doi.org/10.1186/s12859-015-0776-9 Text en © Jin et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Jin, Xian
Awale, Mahendra
Zasso, Michaël
Kostro, Daniel
Patiny, Luc
Reymond, Jean-Louis
PDB-Explorer: a web-based interactive map of the protein data bank in shape space
title PDB-Explorer: a web-based interactive map of the protein data bank in shape space
title_full PDB-Explorer: a web-based interactive map of the protein data bank in shape space
title_fullStr PDB-Explorer: a web-based interactive map of the protein data bank in shape space
title_full_unstemmed PDB-Explorer: a web-based interactive map of the protein data bank in shape space
title_short PDB-Explorer: a web-based interactive map of the protein data bank in shape space
title_sort pdb-explorer: a web-based interactive map of the protein data bank in shape space
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619230/
https://www.ncbi.nlm.nih.gov/pubmed/26493835
http://dx.doi.org/10.1186/s12859-015-0776-9
work_keys_str_mv AT jinxian pdbexplorerawebbasedinteractivemapoftheproteindatabankinshapespace
AT awalemahendra pdbexplorerawebbasedinteractivemapoftheproteindatabankinshapespace
AT zassomichael pdbexplorerawebbasedinteractivemapoftheproteindatabankinshapespace
AT kostrodaniel pdbexplorerawebbasedinteractivemapoftheproteindatabankinshapespace
AT patinyluc pdbexplorerawebbasedinteractivemapoftheproteindatabankinshapespace
AT reymondjeanlouis pdbexplorerawebbasedinteractivemapoftheproteindatabankinshapespace