Cargando…

RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive

The US Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) serves many millions of unique users worldwide by delivering experimentally-determined 3D structures of biomolecules integrated with >40 external data resources via RCSB.org, application programming interface...

Descripción completa

Detalles Bibliográficos
Autores principales: Rose, Yana, Duarte, Jose M., Lowe, Robert, Segura, Joan, Bi, Chunxiao, Bhikadiya, Charmi, Chen, Li, Rose, Alexander S., Bittrich, Sebastian, Burley, Stephen K., Westbrook, John D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9093041/
https://www.ncbi.nlm.nih.gov/pubmed/33186584
http://dx.doi.org/10.1016/j.jmb.2020.11.003
_version_ 1784705252966006784
author Rose, Yana
Duarte, Jose M.
Lowe, Robert
Segura, Joan
Bi, Chunxiao
Bhikadiya, Charmi
Chen, Li
Rose, Alexander S.
Bittrich, Sebastian
Burley, Stephen K.
Westbrook, John D.
author_facet Rose, Yana
Duarte, Jose M.
Lowe, Robert
Segura, Joan
Bi, Chunxiao
Bhikadiya, Charmi
Chen, Li
Rose, Alexander S.
Bittrich, Sebastian
Burley, Stephen K.
Westbrook, John D.
author_sort Rose, Yana
collection PubMed
description The US Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) serves many millions of unique users worldwide by delivering experimentally-determined 3D structures of biomolecules integrated with >40 external data resources via RCSB.org, application programming interfaces (APIs), and FTP downloads. Herein, we present the architectural redesign of RCSB PDB data delivery services that build on existing PDBx/mmCIF data schemas. New data access APIs (data.rcsb.org) enable efficient delivery of all PDB archive data. A novel GraphQL-based API provides flexible, declarative data retrieval along with a simple-to-use REST API. A powerful new search system (search.rcsb.org) seamlessly integrates heterogeneous types of searches across the PDB archive. Searches may combine text attributes, protein or nucleic acid sequences, small-molecule chemical descriptors, 3D macromolecular shapes, and sequence motifs. The new RCSB.org architecture adheres to the FAIR Principles, empowering users to address a wide array of research problems in fundamental biology, biomedicine, biotechnology, bioengineering, and bioenergy.
format Online
Article
Text
id pubmed-9093041
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-90930412022-05-11 RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive Rose, Yana Duarte, Jose M. Lowe, Robert Segura, Joan Bi, Chunxiao Bhikadiya, Charmi Chen, Li Rose, Alexander S. Bittrich, Sebastian Burley, Stephen K. Westbrook, John D. J Mol Biol Article The US Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) serves many millions of unique users worldwide by delivering experimentally-determined 3D structures of biomolecules integrated with >40 external data resources via RCSB.org, application programming interfaces (APIs), and FTP downloads. Herein, we present the architectural redesign of RCSB PDB data delivery services that build on existing PDBx/mmCIF data schemas. New data access APIs (data.rcsb.org) enable efficient delivery of all PDB archive data. A novel GraphQL-based API provides flexible, declarative data retrieval along with a simple-to-use REST API. A powerful new search system (search.rcsb.org) seamlessly integrates heterogeneous types of searches across the PDB archive. Searches may combine text attributes, protein or nucleic acid sequences, small-molecule chemical descriptors, 3D macromolecular shapes, and sequence motifs. The new RCSB.org architecture adheres to the FAIR Principles, empowering users to address a wide array of research problems in fundamental biology, biomedicine, biotechnology, bioengineering, and bioenergy. 2021-05-28 2020-11-10 /pmc/articles/PMC9093041/ /pubmed/33186584 http://dx.doi.org/10.1016/j.jmb.2020.11.003 Text en https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Rose, Yana
Duarte, Jose M.
Lowe, Robert
Segura, Joan
Bi, Chunxiao
Bhikadiya, Charmi
Chen, Li
Rose, Alexander S.
Bittrich, Sebastian
Burley, Stephen K.
Westbrook, John D.
RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive
title RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive
title_full RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive
title_fullStr RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive
title_full_unstemmed RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive
title_short RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive
title_sort rcsb protein data bank: architectural advances towards integrated searching and efficient access to macromolecular structure data from the pdb archive
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9093041/
https://www.ncbi.nlm.nih.gov/pubmed/33186584
http://dx.doi.org/10.1016/j.jmb.2020.11.003
work_keys_str_mv AT roseyana rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT duartejosem rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT lowerobert rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT segurajoan rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT bichunxiao rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT bhikadiyacharmi rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT chenli rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT rosealexanders rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT bittrichsebastian rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT burleystephenk rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive
AT westbrookjohnd rcsbproteindatabankarchitecturaladvancestowardsintegratedsearchingandefficientaccesstomacromolecularstructuredatafromthepdbarchive