Cargando…

Bayesian semiparametric regression models to characterize molecular evolution

BACKGROUND: Statistical models and methods that associate changes in the physicochemical properties of amino acids with natural selection at the molecular level typically do not take into account the correlations between such properties. We propose a Bayesian hierarchical regression model with a gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Datta, Saheli, Rodriguez, Abel, Prado, Raquel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3577475/
https://www.ncbi.nlm.nih.gov/pubmed/23107360
http://dx.doi.org/10.1186/1471-2105-13-278
_version_ 1782259919864463360
author Datta, Saheli
Rodriguez, Abel
Prado, Raquel
author_facet Datta, Saheli
Rodriguez, Abel
Prado, Raquel
author_sort Datta, Saheli
collection PubMed
description BACKGROUND: Statistical models and methods that associate changes in the physicochemical properties of amino acids with natural selection at the molecular level typically do not take into account the correlations between such properties. We propose a Bayesian hierarchical regression model with a generalization of the Dirichlet process prior on the distribution of the regression coefficients that describes the relationship between the changes in amino acid distances and natural selection in protein-coding DNA sequence alignments. RESULTS: The Bayesian semiparametric approach is illustrated with simulated data and the abalone lysin sperm data. Our method identifies groups of properties which, for this particular dataset, have a similar effect on evolution. The model also provides nonparametric site-specific estimates for the strength of conservation of these properties. CONCLUSIONS: The model described here is distinguished by its ability to handle a large number of amino acid properties simultaneously, while taking into account that such data can be correlated. The multi-level clustering ability of the model allows for appealing interpretations of the results in terms of properties that are roughly equivalent from the standpoint of molecular evolution.
format Online
Article
Text
id pubmed-3577475
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35774752013-02-26 Bayesian semiparametric regression models to characterize molecular evolution Datta, Saheli Rodriguez, Abel Prado, Raquel BMC Bioinformatics Research Article BACKGROUND: Statistical models and methods that associate changes in the physicochemical properties of amino acids with natural selection at the molecular level typically do not take into account the correlations between such properties. We propose a Bayesian hierarchical regression model with a generalization of the Dirichlet process prior on the distribution of the regression coefficients that describes the relationship between the changes in amino acid distances and natural selection in protein-coding DNA sequence alignments. RESULTS: The Bayesian semiparametric approach is illustrated with simulated data and the abalone lysin sperm data. Our method identifies groups of properties which, for this particular dataset, have a similar effect on evolution. The model also provides nonparametric site-specific estimates for the strength of conservation of these properties. CONCLUSIONS: The model described here is distinguished by its ability to handle a large number of amino acid properties simultaneously, while taking into account that such data can be correlated. The multi-level clustering ability of the model allows for appealing interpretations of the results in terms of properties that are roughly equivalent from the standpoint of molecular evolution. BioMed Central 2012-10-30 /pmc/articles/PMC3577475/ /pubmed/23107360 http://dx.doi.org/10.1186/1471-2105-13-278 Text en Copyright ©2012 Datta et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Datta, Saheli
Rodriguez, Abel
Prado, Raquel
Bayesian semiparametric regression models to characterize molecular evolution
title Bayesian semiparametric regression models to characterize molecular evolution
title_full Bayesian semiparametric regression models to characterize molecular evolution
title_fullStr Bayesian semiparametric regression models to characterize molecular evolution
title_full_unstemmed Bayesian semiparametric regression models to characterize molecular evolution
title_short Bayesian semiparametric regression models to characterize molecular evolution
title_sort bayesian semiparametric regression models to characterize molecular evolution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3577475/
https://www.ncbi.nlm.nih.gov/pubmed/23107360
http://dx.doi.org/10.1186/1471-2105-13-278
work_keys_str_mv AT dattasaheli bayesiansemiparametricregressionmodelstocharacterizemolecularevolution
AT rodriguezabel bayesiansemiparametricregressionmodelstocharacterizemolecularevolution
AT pradoraquel bayesiansemiparametricregressionmodelstocharacterizemolecularevolution