Cargando…

Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease

The Protein Data Bank (PDB) contains over 71,000 structures. Extensively studied proteins have hundreds of submissions available, including mutations, different complexes, and space groups, allowing for application of data-mining algorithms to analyze an array of static structures and gain insight a...

Descripción completa

Detalles Bibliográficos
Autores principales: Venkatakrishnan, Balasubramanian, Palii, Miorel-Lucian, Agbandje-McKenna, Mavis, McKenna, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3347031/
https://www.ncbi.nlm.nih.gov/pubmed/22590675
http://dx.doi.org/10.3390/v4030348
_version_ 1782232267284807680
author Venkatakrishnan, Balasubramanian
Palii, Miorel-Lucian
Agbandje-McKenna, Mavis
McKenna, Robert
author_facet Venkatakrishnan, Balasubramanian
Palii, Miorel-Lucian
Agbandje-McKenna, Mavis
McKenna, Robert
author_sort Venkatakrishnan, Balasubramanian
collection PubMed
description The Protein Data Bank (PDB) contains over 71,000 structures. Extensively studied proteins have hundreds of submissions available, including mutations, different complexes, and space groups, allowing for application of data-mining algorithms to analyze an array of static structures and gain insight about a protein’s structural variation and possibly its dynamics. This investigation is a case study of HIV protease (PR) using in-house algorithms for data mining and structure superposition through generalized formulæ that account for multiple conformations and fractional occupancies. Temperature factors (B-factors) are compared with spatial displacement from the mean structure over the entire study set and separately over bound and ligand-free structures, to assess the significance of structural deviation in a statistical context. Space group differences are also examined.
format Online
Article
Text
id pubmed-3347031
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-33470312012-05-15 Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease Venkatakrishnan, Balasubramanian Palii, Miorel-Lucian Agbandje-McKenna, Mavis McKenna, Robert Viruses Article The Protein Data Bank (PDB) contains over 71,000 structures. Extensively studied proteins have hundreds of submissions available, including mutations, different complexes, and space groups, allowing for application of data-mining algorithms to analyze an array of static structures and gain insight about a protein’s structural variation and possibly its dynamics. This investigation is a case study of HIV protease (PR) using in-house algorithms for data mining and structure superposition through generalized formulæ that account for multiple conformations and fractional occupancies. Temperature factors (B-factors) are compared with spatial displacement from the mean structure over the entire study set and separately over bound and ligand-free structures, to assess the significance of structural deviation in a statistical context. Space group differences are also examined. MDPI 2012-03-05 /pmc/articles/PMC3347031/ /pubmed/22590675 http://dx.doi.org/10.3390/v4030348 Text en © 2012 by the authors; licensee MDPI, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0/ This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Venkatakrishnan, Balasubramanian
Palii, Miorel-Lucian
Agbandje-McKenna, Mavis
McKenna, Robert
Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_full Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_fullStr Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_full_unstemmed Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_short Mining the Protein Data Bank to Differentiate Error from Structural Variation in Clustered Static Structures: An Examination of HIV Protease
title_sort mining the protein data bank to differentiate error from structural variation in clustered static structures: an examination of hiv protease
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3347031/
https://www.ncbi.nlm.nih.gov/pubmed/22590675
http://dx.doi.org/10.3390/v4030348
work_keys_str_mv AT venkatakrishnanbalasubramanian miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease
AT paliimiorellucian miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease
AT agbandjemckennamavis miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease
AT mckennarobert miningtheproteindatabanktodifferentiateerrorfromstructuralvariationinclusteredstaticstructuresanexaminationofhivprotease