Cargando…
Bayesian semiparametric regression models to characterize molecular evolution
BACKGROUND: Statistical models and methods that associate changes in the physicochemical properties of amino acids with natural selection at the molecular level typically do not take into account the correlations between such properties. We propose a Bayesian hierarchical regression model with a gen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3577475/ https://www.ncbi.nlm.nih.gov/pubmed/23107360 http://dx.doi.org/10.1186/1471-2105-13-278 |
_version_ | 1782259919864463360 |
---|---|
author | Datta, Saheli Rodriguez, Abel Prado, Raquel |
author_facet | Datta, Saheli Rodriguez, Abel Prado, Raquel |
author_sort | Datta, Saheli |
collection | PubMed |
description | BACKGROUND: Statistical models and methods that associate changes in the physicochemical properties of amino acids with natural selection at the molecular level typically do not take into account the correlations between such properties. We propose a Bayesian hierarchical regression model with a generalization of the Dirichlet process prior on the distribution of the regression coefficients that describes the relationship between the changes in amino acid distances and natural selection in protein-coding DNA sequence alignments. RESULTS: The Bayesian semiparametric approach is illustrated with simulated data and the abalone lysin sperm data. Our method identifies groups of properties which, for this particular dataset, have a similar effect on evolution. The model also provides nonparametric site-specific estimates for the strength of conservation of these properties. CONCLUSIONS: The model described here is distinguished by its ability to handle a large number of amino acid properties simultaneously, while taking into account that such data can be correlated. The multi-level clustering ability of the model allows for appealing interpretations of the results in terms of properties that are roughly equivalent from the standpoint of molecular evolution. |
format | Online Article Text |
id | pubmed-3577475 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35774752013-02-26 Bayesian semiparametric regression models to characterize molecular evolution Datta, Saheli Rodriguez, Abel Prado, Raquel BMC Bioinformatics Research Article BACKGROUND: Statistical models and methods that associate changes in the physicochemical properties of amino acids with natural selection at the molecular level typically do not take into account the correlations between such properties. We propose a Bayesian hierarchical regression model with a generalization of the Dirichlet process prior on the distribution of the regression coefficients that describes the relationship between the changes in amino acid distances and natural selection in protein-coding DNA sequence alignments. RESULTS: The Bayesian semiparametric approach is illustrated with simulated data and the abalone lysin sperm data. Our method identifies groups of properties which, for this particular dataset, have a similar effect on evolution. The model also provides nonparametric site-specific estimates for the strength of conservation of these properties. CONCLUSIONS: The model described here is distinguished by its ability to handle a large number of amino acid properties simultaneously, while taking into account that such data can be correlated. The multi-level clustering ability of the model allows for appealing interpretations of the results in terms of properties that are roughly equivalent from the standpoint of molecular evolution. BioMed Central 2012-10-30 /pmc/articles/PMC3577475/ /pubmed/23107360 http://dx.doi.org/10.1186/1471-2105-13-278 Text en Copyright ©2012 Datta et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Datta, Saheli Rodriguez, Abel Prado, Raquel Bayesian semiparametric regression models to characterize molecular evolution |
title | Bayesian semiparametric regression models to characterize molecular evolution |
title_full | Bayesian semiparametric regression models to characterize molecular evolution |
title_fullStr | Bayesian semiparametric regression models to characterize molecular evolution |
title_full_unstemmed | Bayesian semiparametric regression models to characterize molecular evolution |
title_short | Bayesian semiparametric regression models to characterize molecular evolution |
title_sort | bayesian semiparametric regression models to characterize molecular evolution |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3577475/ https://www.ncbi.nlm.nih.gov/pubmed/23107360 http://dx.doi.org/10.1186/1471-2105-13-278 |
work_keys_str_mv | AT dattasaheli bayesiansemiparametricregressionmodelstocharacterizemolecularevolution AT rodriguezabel bayesiansemiparametricregressionmodelstocharacterizemolecularevolution AT pradoraquel bayesiansemiparametricregressionmodelstocharacterizemolecularevolution |