Cargando…

Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life

BACKGROUND: The problem of probabilistic inference of gene content in the last common ancestor of several extant species with completely sequenced genomes is: for each gene that is conserved in all or some of the genomes, assign the probability that its ancestral gene was present in the genome of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Kannan, Lavanya, Li, Hua, Rubinstein, Boris, Mushegian, Arcady
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3892064/
https://www.ncbi.nlm.nih.gov/pubmed/24354654
http://dx.doi.org/10.1186/1745-6150-8-32
Descripción
Sumario:BACKGROUND: The problem of probabilistic inference of gene content in the last common ancestor of several extant species with completely sequenced genomes is: for each gene that is conserved in all or some of the genomes, assign the probability that its ancestral gene was present in the genome of their last common ancestor. RESULTS: We have developed a family of models of gene gain and gene loss in evolution, and applied the maximum-likelihood approach that uses phylogenetic tree of prokaryotes and the record of orthologous relationships between their genes to infer the gene content of LUCA, the Last Universal Common Ancestor of all currently living cellular organisms. The crucial parameter, the ratio of gene losses and gene gains, was estimated from the data and was higher in models that take account of the number of in-paralogs in genomes than in models that treat gene presences and absences as a binary trait. CONCLUSION: While the numbers of genes that are placed confidently into LUCA are similar in the ML methods and in previously published methods that use various parsimony-based approaches, the identities of genes themselves are different. Most of the models of either kind treat the genes found in many existing genomes in a similar way, assigning to them high probabilities of being ancestral (“high ancestrality”). The ML models are more likely than others to assign high ancestrality to the genes that are relatively rare in the present-day genomes. REVIEWERS: This article was reviewed by Martijn A Huynen, Toni Gabaldón and Fyodor Kondrashov.