Cargando…

Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data

Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macr...

Descripción completa

Detalles Bibliográficos
Autores principales: Appasamy, Sri Devan, Berrisford, John, Gaborova, Romana, Nair, Sreenath, Anyango, Stephen, Grudinin, Sergei, Deshpande, Mandar, Armstrong, David, Pidruchna, Ivanna, Ellaway, Joseph I. J., Leines, Grisell Díaz, Gupta, Deepti, Harrus, Deborah, Varadi, Mihaly, Velankar, Sameer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692154/
https://www.ncbi.nlm.nih.gov/pubmed/38040737
http://dx.doi.org/10.1038/s41597-023-02778-9
_version_ 1785152881192599552
author Appasamy, Sri Devan
Berrisford, John
Gaborova, Romana
Nair, Sreenath
Anyango, Stephen
Grudinin, Sergei
Deshpande, Mandar
Armstrong, David
Pidruchna, Ivanna
Ellaway, Joseph I. J.
Leines, Grisell Díaz
Gupta, Deepti
Harrus, Deborah
Varadi, Mihaly
Velankar, Sameer
author_facet Appasamy, Sri Devan
Berrisford, John
Gaborova, Romana
Nair, Sreenath
Anyango, Stephen
Grudinin, Sergei
Deshpande, Mandar
Armstrong, David
Pidruchna, Ivanna
Ellaway, Joseph I. J.
Leines, Grisell Díaz
Gupta, Deepti
Harrus, Deborah
Varadi, Mihaly
Velankar, Sameer
author_sort Appasamy, Sri Devan
collection PubMed
description Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macromolecules. Structural data in the PDB offer valuable insights into the dynamics, conformation, and functional states of biological assemblies. However, the current annotation practices lack standardised naming conventions for assemblies in the PDB, complicating the identification of instances representing the same assembly. In this study, we introduce a method leveraging resources external to PDB, such as the Complex Portal, UniProt and Gene Ontology, to describe assemblies and contextualise them within their biological settings accurately. Employing the proposed approach, we assigned standard names to over 90% of unique assemblies in the PDB and provided persistent identifiers for each assembly. This standardisation of assembly data enhances the PDB, facilitating a deeper understanding of macromolecular complexes. Furthermore, the data standardisation improves the PDB’s FAIR attributes, fostering more effective basic and translational research and scientific education.
format Online
Article
Text
id pubmed-10692154
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106921542023-12-03 Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data Appasamy, Sri Devan Berrisford, John Gaborova, Romana Nair, Sreenath Anyango, Stephen Grudinin, Sergei Deshpande, Mandar Armstrong, David Pidruchna, Ivanna Ellaway, Joseph I. J. Leines, Grisell Díaz Gupta, Deepti Harrus, Deborah Varadi, Mihaly Velankar, Sameer Sci Data Article Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macromolecules. Structural data in the PDB offer valuable insights into the dynamics, conformation, and functional states of biological assemblies. However, the current annotation practices lack standardised naming conventions for assemblies in the PDB, complicating the identification of instances representing the same assembly. In this study, we introduce a method leveraging resources external to PDB, such as the Complex Portal, UniProt and Gene Ontology, to describe assemblies and contextualise them within their biological settings accurately. Employing the proposed approach, we assigned standard names to over 90% of unique assemblies in the PDB and provided persistent identifiers for each assembly. This standardisation of assembly data enhances the PDB, facilitating a deeper understanding of macromolecular complexes. Furthermore, the data standardisation improves the PDB’s FAIR attributes, fostering more effective basic and translational research and scientific education. Nature Publishing Group UK 2023-12-01 /pmc/articles/PMC10692154/ /pubmed/38040737 http://dx.doi.org/10.1038/s41597-023-02778-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Appasamy, Sri Devan
Berrisford, John
Gaborova, Romana
Nair, Sreenath
Anyango, Stephen
Grudinin, Sergei
Deshpande, Mandar
Armstrong, David
Pidruchna, Ivanna
Ellaway, Joseph I. J.
Leines, Grisell Díaz
Gupta, Deepti
Harrus, Deborah
Varadi, Mihaly
Velankar, Sameer
Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data
title Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data
title_full Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data
title_fullStr Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data
title_full_unstemmed Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data
title_short Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data
title_sort annotating macromolecular complexes in the protein data bank: improving the fairness of structure data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692154/
https://www.ncbi.nlm.nih.gov/pubmed/38040737
http://dx.doi.org/10.1038/s41597-023-02778-9
work_keys_str_mv AT appasamysridevan annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT berrisfordjohn annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT gaborovaromana annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT nairsreenath annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT anyangostephen annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT grudininsergei annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT deshpandemandar annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT armstrongdavid annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT pidruchnaivanna annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT ellawayjosephij annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT leinesgriselldiaz annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT guptadeepti annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT harrusdeborah annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT varadimihaly annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata
AT velankarsameer annotatingmacromolecularcomplexesintheproteindatabankimprovingthefairnessofstructuredata