Cargando…

Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis

BACKGROUND: Solvent accessibility (ASA) of amino acid residues is often transformed from absolute values of exposed surface area to their normalized relative values. This normalization is typically attained by assuming a highest exposure conformation based on extended state of that residue when it i...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Hemajit, Ahmad, Shandar
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2685369/
https://www.ncbi.nlm.nih.gov/pubmed/19397821
http://dx.doi.org/10.1186/1472-6807-9-25
_version_ 1782167314347589632
author Singh, Hemajit
Ahmad, Shandar
author_facet Singh, Hemajit
Ahmad, Shandar
author_sort Singh, Hemajit
collection PubMed
description BACKGROUND: Solvent accessibility (ASA) of amino acid residues is often transformed from absolute values of exposed surface area to their normalized relative values. This normalization is typically attained by assuming a highest exposure conformation based on extended state of that residue when it is surrounded by Ala or Gly on both sides i.e. Ala-X-Ala or Gly-X-Gly solvent exposed area. Exact sequence context, the folding state of the residues, and the actual environment of a folded protein, which do impose additional constraints on the highest possible (or highest observed) values of ASA, are currently ignored. Here, we analyze the statistics of these constraints and examine how the normalization of absolute ASA values using context-dependent Highest Observed ASA (HOA) instead of context-free extended state ASA (ESA) of residues can influence the performance of sequence-based prediction of solvent accessibility. Characterization of burial and exposed states of residues based on this normalization has also been shown to provide better enrichment of DNA-binding sites in exposed residues. RESULTS: We compiled the statistics of highest observed ASA (HOA) of residues in their different contexts and analyzed their distribution in all 400 possible combinations for each residue type. We observe that many trippetides are more exposed than ESA and that HOA residues are often found in turn, coil and bend conformations. On the other hand several residues are never observed in an exposure state close to ESA values. A neural networks trained with HOA-normalized data outperforms the one trained with ESA-normalized values. However, the improvements are subtle in some residues, while they are more significant in others. CONCLUSION: HOA based normalization of solvent accessibility from native structures is proposed and it shows improvement in sequence-based predictability, as well as enrichment in interface residues on surface. There may still be some difference between the highest possible ASA and highest observed ASA due to an insufficiently covered space of ASA distribution in the PDB, which limit the overall improvement in prediction to a relatively modest degree.
format Text
id pubmed-2685369
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26853692009-05-22 Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis Singh, Hemajit Ahmad, Shandar BMC Struct Biol Research Article BACKGROUND: Solvent accessibility (ASA) of amino acid residues is often transformed from absolute values of exposed surface area to their normalized relative values. This normalization is typically attained by assuming a highest exposure conformation based on extended state of that residue when it is surrounded by Ala or Gly on both sides i.e. Ala-X-Ala or Gly-X-Gly solvent exposed area. Exact sequence context, the folding state of the residues, and the actual environment of a folded protein, which do impose additional constraints on the highest possible (or highest observed) values of ASA, are currently ignored. Here, we analyze the statistics of these constraints and examine how the normalization of absolute ASA values using context-dependent Highest Observed ASA (HOA) instead of context-free extended state ASA (ESA) of residues can influence the performance of sequence-based prediction of solvent accessibility. Characterization of burial and exposed states of residues based on this normalization has also been shown to provide better enrichment of DNA-binding sites in exposed residues. RESULTS: We compiled the statistics of highest observed ASA (HOA) of residues in their different contexts and analyzed their distribution in all 400 possible combinations for each residue type. We observe that many trippetides are more exposed than ESA and that HOA residues are often found in turn, coil and bend conformations. On the other hand several residues are never observed in an exposure state close to ESA values. A neural networks trained with HOA-normalized data outperforms the one trained with ESA-normalized values. However, the improvements are subtle in some residues, while they are more significant in others. CONCLUSION: HOA based normalization of solvent accessibility from native structures is proposed and it shows improvement in sequence-based predictability, as well as enrichment in interface residues on surface. There may still be some difference between the highest possible ASA and highest observed ASA due to an insufficiently covered space of ASA distribution in the PDB, which limit the overall improvement in prediction to a relatively modest degree. BioMed Central 2009-04-27 /pmc/articles/PMC2685369/ /pubmed/19397821 http://dx.doi.org/10.1186/1472-6807-9-25 Text en Copyright © 2009 Singh and Ahmad; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Singh, Hemajit
Ahmad, Shandar
Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis
title Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis
title_full Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis
title_fullStr Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis
title_full_unstemmed Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis
title_short Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis
title_sort context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2685369/
https://www.ncbi.nlm.nih.gov/pubmed/19397821
http://dx.doi.org/10.1186/1472-6807-9-25
work_keys_str_mv AT singhhemajit contextdependentreferencestatesofsolventaccessibilityderivedfromnativeproteinstructuresandassessedbypredictabilityanalysis
AT ahmadshandar contextdependentreferencestatesofsolventaccessibilityderivedfromnativeproteinstructuresandassessedbypredictabilityanalysis