Cargando…

A Bayesian model for gene family evolution

BACKGROUND: A birth and death process is frequently used for modeling the size of a gene family that may vary along the branches of a phylogenetic tree. Under the birth and death model, maximum likelihood methods have been developed to estimate the birth and death rate and the sizes of ancient gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Liang, Yu, Lili, Kalavacharla, Venugopal, Liu, Zhanji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3774087/
https://www.ncbi.nlm.nih.gov/pubmed/22044581
http://dx.doi.org/10.1186/1471-2105-12-426
_version_ 1782284458381017088
author Liu, Liang
Yu, Lili
Kalavacharla, Venugopal
Liu, Zhanji
author_facet Liu, Liang
Yu, Lili
Kalavacharla, Venugopal
Liu, Zhanji
author_sort Liu, Liang
collection PubMed
description BACKGROUND: A birth and death process is frequently used for modeling the size of a gene family that may vary along the branches of a phylogenetic tree. Under the birth and death model, maximum likelihood methods have been developed to estimate the birth and death rate and the sizes of ancient gene families (numbers of gene copies at the internodes of the phylogenetic tree). This paper aims to provide a Bayesian approach for estimating parameters in the birth and death model. RESULTS: We develop a Bayesian approach for estimating the birth and death rate and other parameters in the birth and death model. In addition, a Bayesian hypothesis test is developed to identify the gene families that are unlikely under the birth and death process. Simulation results suggest that the Bayesian estimate is more accurate than the maximum likelihood estimate of the birth and death rate. The Bayesian approach was applied to a real dataset of 3517 gene families across genomes of five yeast species. The results indicate that the Bayesian model assuming a constant birth and death rate among branches of the phylogenetic tree cannot adequately explain the observed pattern of the sizes of gene families across species. The yeast dataset was thus analyzed with a Bayesian heterogeneous rate model that allows the birth and death rate to vary among the branches of the tree. The unlikely gene families identified by the Bayesian heterogeneous rate model are different from those given by the maximum likelihood method. CONCLUSIONS: Compared to the maximum likelihood method, the Bayesian approach can produce more accurate estimates of the parameters in the birth and death model. In addition, the Bayesian hypothesis test is able to identify unlikely gene families based on Bayesian posterior p-values. As a powerful statistical technique, the Bayesian approach can effectively extract information from gene family data and thereby provide useful information regarding the evolutionary process of gene families across genomes.
format Online
Article
Text
id pubmed-3774087
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37740872013-09-17 A Bayesian model for gene family evolution Liu, Liang Yu, Lili Kalavacharla, Venugopal Liu, Zhanji BMC Bioinformatics Methodology Article BACKGROUND: A birth and death process is frequently used for modeling the size of a gene family that may vary along the branches of a phylogenetic tree. Under the birth and death model, maximum likelihood methods have been developed to estimate the birth and death rate and the sizes of ancient gene families (numbers of gene copies at the internodes of the phylogenetic tree). This paper aims to provide a Bayesian approach for estimating parameters in the birth and death model. RESULTS: We develop a Bayesian approach for estimating the birth and death rate and other parameters in the birth and death model. In addition, a Bayesian hypothesis test is developed to identify the gene families that are unlikely under the birth and death process. Simulation results suggest that the Bayesian estimate is more accurate than the maximum likelihood estimate of the birth and death rate. The Bayesian approach was applied to a real dataset of 3517 gene families across genomes of five yeast species. The results indicate that the Bayesian model assuming a constant birth and death rate among branches of the phylogenetic tree cannot adequately explain the observed pattern of the sizes of gene families across species. The yeast dataset was thus analyzed with a Bayesian heterogeneous rate model that allows the birth and death rate to vary among the branches of the tree. The unlikely gene families identified by the Bayesian heterogeneous rate model are different from those given by the maximum likelihood method. CONCLUSIONS: Compared to the maximum likelihood method, the Bayesian approach can produce more accurate estimates of the parameters in the birth and death model. In addition, the Bayesian hypothesis test is able to identify unlikely gene families based on Bayesian posterior p-values. As a powerful statistical technique, the Bayesian approach can effectively extract information from gene family data and thereby provide useful information regarding the evolutionary process of gene families across genomes. BioMed Central 2011-11-01 /pmc/articles/PMC3774087/ /pubmed/22044581 http://dx.doi.org/10.1186/1471-2105-12-426 Text en Copyright ©2011 Liu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Liu, Liang
Yu, Lili
Kalavacharla, Venugopal
Liu, Zhanji
A Bayesian model for gene family evolution
title A Bayesian model for gene family evolution
title_full A Bayesian model for gene family evolution
title_fullStr A Bayesian model for gene family evolution
title_full_unstemmed A Bayesian model for gene family evolution
title_short A Bayesian model for gene family evolution
title_sort bayesian model for gene family evolution
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3774087/
https://www.ncbi.nlm.nih.gov/pubmed/22044581
http://dx.doi.org/10.1186/1471-2105-12-426
work_keys_str_mv AT liuliang abayesianmodelforgenefamilyevolution
AT yulili abayesianmodelforgenefamilyevolution
AT kalavacharlavenugopal abayesianmodelforgenefamilyevolution
AT liuzhanji abayesianmodelforgenefamilyevolution
AT liuliang bayesianmodelforgenefamilyevolution
AT yulili bayesianmodelforgenefamilyevolution
AT kalavacharlavenugopal bayesianmodelforgenefamilyevolution
AT liuzhanji bayesianmodelforgenefamilyevolution