Cargando…

Investigation of protein quaternary structure via stoichiometry and symmetry ınformation

The Protein Data Bank (PDB) is the single worldwide archive of experimentally-determined three-dimensional (3D) structures of proteins and nucleic acids. As of January 2017, the PDB housed more than 125,000 structures and was growing by more than 11,000 structures annually. Since the 3D structure of...

Descripción completa

Detalles Bibliográficos
Autores principales:	Korkmaz, Selcuk, Duarte, Jose M., Prlić, Andreas, Goksuluk, Dincer, Zararsiz, Gokmen, Saracbasi, Osman, Burley, Stephen K., Rose, Peter W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5986128/ https://www.ncbi.nlm.nih.gov/pubmed/29864163 http://dx.doi.org/10.1371/journal.pone.0197176

_version_	1783328874511204352
author	Korkmaz, Selcuk Duarte, Jose M. Prlić, Andreas Goksuluk, Dincer Zararsiz, Gokmen Saracbasi, Osman Burley, Stephen K. Rose, Peter W.
author_facet	Korkmaz, Selcuk Duarte, Jose M. Prlić, Andreas Goksuluk, Dincer Zararsiz, Gokmen Saracbasi, Osman Burley, Stephen K. Rose, Peter W.
author_sort	Korkmaz, Selcuk
collection	PubMed
description	The Protein Data Bank (PDB) is the single worldwide archive of experimentally-determined three-dimensional (3D) structures of proteins and nucleic acids. As of January 2017, the PDB housed more than 125,000 structures and was growing by more than 11,000 structures annually. Since the 3D structure of a protein is vital to understand the mechanisms of biological processes, diseases, and drug design, correct oligomeric assembly information is of critical importance. Unfortunately, the biologically relevant oligomeric form of a 3D structure is not directly obtainable by X-ray crystallography, whilst in solution methods (NMR or single particle EM) it is known from the experiment. Instead, this information may be provided by the PDB Depositor as metadata coming from additional experiments, be inferred by sequence-sequence comparisons with similar proteins of known oligomeric state, or predicted using software, such as PISA (Proteins, Interfaces, Structures and Assemblies) or EPPIC (Evolutionary Protein Protein Interface Classifier). Despite significant efforts by professional PDB Biocurators during data deposition, there remain a number of structures in the archive with incorrect quaternary structure descriptions (or annotations). Further investigation is, therefore, needed to evaluate the correctness of quaternary structure annotations. In this study, we aim to identify the most probable oligomeric states for proteins represented in the PDB. Our approach evaluated the performance of four independent prediction methods, including text mining of primary publications, inference from homologous protein structures, and two computational methods (PISA and EPPIC). Aggregating predictions to give consensus results outperformed all four of the independent prediction methods, yielding 83% correct, 9% wrong, and 8% inconclusive predictions, when tested with a well-curated benchmark dataset. We have developed a freely-available web-based tool to make this approach accessible to researchers and PDB Biocurators (http://quatstruct.rcsb.org/).
format	Online Article Text
id	pubmed-5986128
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-59861282018-06-16 Investigation of protein quaternary structure via stoichiometry and symmetry ınformation Korkmaz, Selcuk Duarte, Jose M. Prlić, Andreas Goksuluk, Dincer Zararsiz, Gokmen Saracbasi, Osman Burley, Stephen K. Rose, Peter W. PLoS One Research Article The Protein Data Bank (PDB) is the single worldwide archive of experimentally-determined three-dimensional (3D) structures of proteins and nucleic acids. As of January 2017, the PDB housed more than 125,000 structures and was growing by more than 11,000 structures annually. Since the 3D structure of a protein is vital to understand the mechanisms of biological processes, diseases, and drug design, correct oligomeric assembly information is of critical importance. Unfortunately, the biologically relevant oligomeric form of a 3D structure is not directly obtainable by X-ray crystallography, whilst in solution methods (NMR or single particle EM) it is known from the experiment. Instead, this information may be provided by the PDB Depositor as metadata coming from additional experiments, be inferred by sequence-sequence comparisons with similar proteins of known oligomeric state, or predicted using software, such as PISA (Proteins, Interfaces, Structures and Assemblies) or EPPIC (Evolutionary Protein Protein Interface Classifier). Despite significant efforts by professional PDB Biocurators during data deposition, there remain a number of structures in the archive with incorrect quaternary structure descriptions (or annotations). Further investigation is, therefore, needed to evaluate the correctness of quaternary structure annotations. In this study, we aim to identify the most probable oligomeric states for proteins represented in the PDB. Our approach evaluated the performance of four independent prediction methods, including text mining of primary publications, inference from homologous protein structures, and two computational methods (PISA and EPPIC). Aggregating predictions to give consensus results outperformed all four of the independent prediction methods, yielding 83% correct, 9% wrong, and 8% inconclusive predictions, when tested with a well-curated benchmark dataset. We have developed a freely-available web-based tool to make this approach accessible to researchers and PDB Biocurators (http://quatstruct.rcsb.org/). Public Library of Science 2018-06-04 /pmc/articles/PMC5986128/ /pubmed/29864163 http://dx.doi.org/10.1371/journal.pone.0197176 Text en © 2018 Korkmaz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Korkmaz, Selcuk Duarte, Jose M. Prlić, Andreas Goksuluk, Dincer Zararsiz, Gokmen Saracbasi, Osman Burley, Stephen K. Rose, Peter W. Investigation of protein quaternary structure via stoichiometry and symmetry ınformation
title	Investigation of protein quaternary structure via stoichiometry and symmetry ınformation
title_full	Investigation of protein quaternary structure via stoichiometry and symmetry ınformation
title_fullStr	Investigation of protein quaternary structure via stoichiometry and symmetry ınformation
title_full_unstemmed	Investigation of protein quaternary structure via stoichiometry and symmetry ınformation
title_short	Investigation of protein quaternary structure via stoichiometry and symmetry ınformation
title_sort	investigation of protein quaternary structure via stoichiometry and symmetry ınformation
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5986128/ https://www.ncbi.nlm.nih.gov/pubmed/29864163 http://dx.doi.org/10.1371/journal.pone.0197176
work_keys_str_mv	AT korkmazselcuk investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation AT duartejosem investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation AT prlicandreas investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation AT goksulukdincer investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation AT zararsizgokmen investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation AT saracbasiosman investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation AT burleystephenk investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation AT rosepeterw investigationofproteinquaternarystructureviastoichiometryandsymmetryınformation

Investigation of protein quaternary structure via stoichiometry and symmetry ınformation

Ejemplares similares