Cargando…

PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry

An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS...

Descripción completa

Detalles Bibliográficos
Autores principales: Dey, Sucharita, Levy, Emmanuel D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cell Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575123/
https://www.ncbi.nlm.nih.gov/pubmed/34520740
http://dx.doi.org/10.1016/j.str.2021.07.012
_version_ 1784595612525658112
author Dey, Sucharita
Levy, Emmanuel D.
author_facet Dey, Sucharita
Levy, Emmanuel D.
author_sort Dey, Sucharita
collection PubMed
description An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS conservation across homologs to infer the biological relevance of hetero-oligomers. We compare the structures and compositions of hetero-oligomers, which allow us to annotate 7,810 complexes as physiologically relevant, 1,060 as likely errors, and 1,432 with comparative information on subunit stoichiometry and composition. Excluding immunoglobulins, these annotations encompass over 51% of hetero-oligomers in the PDB. We curate a dataset of 577 hetero-oligomeric complexes to benchmark these annotations, which reveals an accuracy >94%. When homology information is not available, we compare QS across repositories (PDB, PISA, and EPPIC) to derive confidence estimates. This work provides high-quality annotations along with a large benchmark dataset of hetero-assemblies.
format Online
Article
Text
id pubmed-8575123
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cell Press
record_format MEDLINE/PubMed
spelling pubmed-85751232021-11-10 PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry Dey, Sucharita Levy, Emmanuel D. Structure Resource An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS conservation across homologs to infer the biological relevance of hetero-oligomers. We compare the structures and compositions of hetero-oligomers, which allow us to annotate 7,810 complexes as physiologically relevant, 1,060 as likely errors, and 1,432 with comparative information on subunit stoichiometry and composition. Excluding immunoglobulins, these annotations encompass over 51% of hetero-oligomers in the PDB. We curate a dataset of 577 hetero-oligomeric complexes to benchmark these annotations, which reveals an accuracy >94%. When homology information is not available, we compare QS across repositories (PDB, PISA, and EPPIC) to derive confidence estimates. This work provides high-quality annotations along with a large benchmark dataset of hetero-assemblies. Cell Press 2021-11-04 /pmc/articles/PMC8575123/ /pubmed/34520740 http://dx.doi.org/10.1016/j.str.2021.07.012 Text en © 2021 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Resource
Dey, Sucharita
Levy, Emmanuel D.
PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry
title PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry
title_full PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry
title_fullStr PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry
title_full_unstemmed PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry
title_short PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry
title_sort pdb-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8575123/
https://www.ncbi.nlm.nih.gov/pubmed/34520740
http://dx.doi.org/10.1016/j.str.2021.07.012
work_keys_str_mv AT deysucharita pdbwideidentificationofphysiologicalheterooligomericassembliesbasedonconservedquaternarystructuregeometry
AT levyemmanueld pdbwideidentificationofphysiologicalheterooligomericassembliesbasedonconservedquaternarystructuregeometry