Cargando…

Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations

Over evolutionary time, members of a superfamily of homologous proteins sharing a common structural core diverge into subgroups filling various functional niches. At the sequence level, such divergence appears as correlations that arise from residue patterns distinct to each subgroup. Such a superfa...

Descripción completa

Detalles Bibliográficos
Autores principales: Neuwald, Andrew F., Altschul, Stephen F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5225019/
https://www.ncbi.nlm.nih.gov/pubmed/28002465
http://dx.doi.org/10.1371/journal.pcbi.1005294
_version_ 1782493449707061248
author Neuwald, Andrew F.
Altschul, Stephen F.
author_facet Neuwald, Andrew F.
Altschul, Stephen F.
author_sort Neuwald, Andrew F.
collection PubMed
description Over evolutionary time, members of a superfamily of homologous proteins sharing a common structural core diverge into subgroups filling various functional niches. At the sequence level, such divergence appears as correlations that arise from residue patterns distinct to each subgroup. Such a superfamily may be viewed as a population of sequences corresponding to a complex, high-dimensional probability distribution. Here we model this distribution as hierarchical interrelated hidden Markov models (hiHMMs), which describe these sequence correlations implicitly. By characterizing such correlations one may hope to obtain information regarding functionally-relevant properties that have thus far evaded detection. To do so, we infer a hiHMM distribution from sequence data using Bayes’ theorem and Markov chain Monte Carlo (MCMC) sampling, which is widely recognized as the most effective approach for characterizing a complex, high dimensional distribution. Other routines then map correlated residue patterns to available structures with a view to hypothesis generation. When applied to N-acetyltransferases, this reveals sequence and structural features indicative of functionally important, yet generally unknown biochemical properties. Even for sets of proteins for which nothing is known beyond unannotated sequences and structures, this can lead to helpful insights. We describe, for example, a putative coenzyme-A-induced-fit substrate binding mechanism mediated by arginine residue switching between salt bridge and π-π stacking interactions. A suite of programs implementing this approach is available (psed.igs.umaryland.edu).
format Online
Article
Text
id pubmed-5225019
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-52250192017-01-25 Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations Neuwald, Andrew F. Altschul, Stephen F. PLoS Comput Biol Research Article Over evolutionary time, members of a superfamily of homologous proteins sharing a common structural core diverge into subgroups filling various functional niches. At the sequence level, such divergence appears as correlations that arise from residue patterns distinct to each subgroup. Such a superfamily may be viewed as a population of sequences corresponding to a complex, high-dimensional probability distribution. Here we model this distribution as hierarchical interrelated hidden Markov models (hiHMMs), which describe these sequence correlations implicitly. By characterizing such correlations one may hope to obtain information regarding functionally-relevant properties that have thus far evaded detection. To do so, we infer a hiHMM distribution from sequence data using Bayes’ theorem and Markov chain Monte Carlo (MCMC) sampling, which is widely recognized as the most effective approach for characterizing a complex, high dimensional distribution. Other routines then map correlated residue patterns to available structures with a view to hypothesis generation. When applied to N-acetyltransferases, this reveals sequence and structural features indicative of functionally important, yet generally unknown biochemical properties. Even for sets of proteins for which nothing is known beyond unannotated sequences and structures, this can lead to helpful insights. We describe, for example, a putative coenzyme-A-induced-fit substrate binding mechanism mediated by arginine residue switching between salt bridge and π-π stacking interactions. A suite of programs implementing this approach is available (psed.igs.umaryland.edu). Public Library of Science 2016-12-21 /pmc/articles/PMC5225019/ /pubmed/28002465 http://dx.doi.org/10.1371/journal.pcbi.1005294 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Neuwald, Andrew F.
Altschul, Stephen F.
Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations
title Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations
title_full Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations
title_fullStr Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations
title_full_unstemmed Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations
title_short Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations
title_sort inference of functionally-relevant n-acetyltransferase residues based on statistical correlations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5225019/
https://www.ncbi.nlm.nih.gov/pubmed/28002465
http://dx.doi.org/10.1371/journal.pcbi.1005294
work_keys_str_mv AT neuwaldandrewf inferenceoffunctionallyrelevantnacetyltransferaseresiduesbasedonstatisticalcorrelations
AT altschulstephenf inferenceoffunctionallyrelevantnacetyltransferaseresiduesbasedonstatisticalcorrelations