Cargando…

Real time structural search of the Protein Data Bank

Detection of protein structure similarity is a central challenge in structural bioinformatics. Comparisons are usually performed at the polypeptide chain level, however the functional form of a protein within the cell is often an oligomer. This fact, together with recent growth of oligomeric structu...

Descripción completa

Detalles Bibliográficos
Autores principales: Guzenko, Dmytro, Burley, Stephen K., Duarte, Jose M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7371193/
https://www.ncbi.nlm.nih.gov/pubmed/32639954
http://dx.doi.org/10.1371/journal.pcbi.1007970
_version_ 1783561099105271808
author Guzenko, Dmytro
Burley, Stephen K.
Duarte, Jose M.
author_facet Guzenko, Dmytro
Burley, Stephen K.
Duarte, Jose M.
author_sort Guzenko, Dmytro
collection PubMed
description Detection of protein structure similarity is a central challenge in structural bioinformatics. Comparisons are usually performed at the polypeptide chain level, however the functional form of a protein within the cell is often an oligomer. This fact, together with recent growth of oligomeric structures in the Protein Data Bank (PDB), demands more efficient approaches to oligomeric assembly alignment/retrieval. Traditional methods use atom level information, which can be complicated by the presence of topological permutations within a polypeptide chain and/or subunit rearrangements. These challenges can be overcome by comparing electron density volumes directly. But, brute force alignment of 3D data is a compute intensive search problem. We developed a 3D Zernike moment normalization procedure to orient electron density volumes and assess similarity with unprecedented speed. Similarity searching with this approach enables real-time retrieval of proteins/protein assemblies resembling a target, from PDB or user input, together with resulting alignments (http://shape.rcsb.org).
format Online
Article
Text
id pubmed-7371193
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73711932020-07-29 Real time structural search of the Protein Data Bank Guzenko, Dmytro Burley, Stephen K. Duarte, Jose M. PLoS Comput Biol Research Article Detection of protein structure similarity is a central challenge in structural bioinformatics. Comparisons are usually performed at the polypeptide chain level, however the functional form of a protein within the cell is often an oligomer. This fact, together with recent growth of oligomeric structures in the Protein Data Bank (PDB), demands more efficient approaches to oligomeric assembly alignment/retrieval. Traditional methods use atom level information, which can be complicated by the presence of topological permutations within a polypeptide chain and/or subunit rearrangements. These challenges can be overcome by comparing electron density volumes directly. But, brute force alignment of 3D data is a compute intensive search problem. We developed a 3D Zernike moment normalization procedure to orient electron density volumes and assess similarity with unprecedented speed. Similarity searching with this approach enables real-time retrieval of proteins/protein assemblies resembling a target, from PDB or user input, together with resulting alignments (http://shape.rcsb.org). Public Library of Science 2020-07-08 /pmc/articles/PMC7371193/ /pubmed/32639954 http://dx.doi.org/10.1371/journal.pcbi.1007970 Text en © 2020 Guzenko et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Guzenko, Dmytro
Burley, Stephen K.
Duarte, Jose M.
Real time structural search of the Protein Data Bank
title Real time structural search of the Protein Data Bank
title_full Real time structural search of the Protein Data Bank
title_fullStr Real time structural search of the Protein Data Bank
title_full_unstemmed Real time structural search of the Protein Data Bank
title_short Real time structural search of the Protein Data Bank
title_sort real time structural search of the protein data bank
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7371193/
https://www.ncbi.nlm.nih.gov/pubmed/32639954
http://dx.doi.org/10.1371/journal.pcbi.1007970
work_keys_str_mv AT guzenkodmytro realtimestructuralsearchoftheproteindatabank
AT burleystephenk realtimestructuralsearchoftheproteindatabank
AT duartejosem realtimestructuralsearchoftheproteindatabank