Cargando…

Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life

BACKGROUND: The problem of probabilistic inference of gene content in the last common ancestor of several extant species with completely sequenced genomes is: for each gene that is conserved in all or some of the genomes, assign the probability that its ancestral gene was present in the genome of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Kannan, Lavanya, Li, Hua, Rubinstein, Boris, Mushegian, Arcady
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3892064/
https://www.ncbi.nlm.nih.gov/pubmed/24354654
http://dx.doi.org/10.1186/1745-6150-8-32
_version_ 1782299456942637056
author Kannan, Lavanya
Li, Hua
Rubinstein, Boris
Mushegian, Arcady
author_facet Kannan, Lavanya
Li, Hua
Rubinstein, Boris
Mushegian, Arcady
author_sort Kannan, Lavanya
collection PubMed
description BACKGROUND: The problem of probabilistic inference of gene content in the last common ancestor of several extant species with completely sequenced genomes is: for each gene that is conserved in all or some of the genomes, assign the probability that its ancestral gene was present in the genome of their last common ancestor. RESULTS: We have developed a family of models of gene gain and gene loss in evolution, and applied the maximum-likelihood approach that uses phylogenetic tree of prokaryotes and the record of orthologous relationships between their genes to infer the gene content of LUCA, the Last Universal Common Ancestor of all currently living cellular organisms. The crucial parameter, the ratio of gene losses and gene gains, was estimated from the data and was higher in models that take account of the number of in-paralogs in genomes than in models that treat gene presences and absences as a binary trait. CONCLUSION: While the numbers of genes that are placed confidently into LUCA are similar in the ML methods and in previously published methods that use various parsimony-based approaches, the identities of genes themselves are different. Most of the models of either kind treat the genes found in many existing genomes in a similar way, assigning to them high probabilities of being ancestral (“high ancestrality”). The ML models are more likely than others to assign high ancestrality to the genes that are relatively rare in the present-day genomes. REVIEWERS: This article was reviewed by Martijn A Huynen, Toni Gabaldón and Fyodor Kondrashov.
format Online
Article
Text
id pubmed-3892064
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38920642014-01-15 Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life Kannan, Lavanya Li, Hua Rubinstein, Boris Mushegian, Arcady Biol Direct Research BACKGROUND: The problem of probabilistic inference of gene content in the last common ancestor of several extant species with completely sequenced genomes is: for each gene that is conserved in all or some of the genomes, assign the probability that its ancestral gene was present in the genome of their last common ancestor. RESULTS: We have developed a family of models of gene gain and gene loss in evolution, and applied the maximum-likelihood approach that uses phylogenetic tree of prokaryotes and the record of orthologous relationships between their genes to infer the gene content of LUCA, the Last Universal Common Ancestor of all currently living cellular organisms. The crucial parameter, the ratio of gene losses and gene gains, was estimated from the data and was higher in models that take account of the number of in-paralogs in genomes than in models that treat gene presences and absences as a binary trait. CONCLUSION: While the numbers of genes that are placed confidently into LUCA are similar in the ML methods and in previously published methods that use various parsimony-based approaches, the identities of genes themselves are different. Most of the models of either kind treat the genes found in many existing genomes in a similar way, assigning to them high probabilities of being ancestral (“high ancestrality”). The ML models are more likely than others to assign high ancestrality to the genes that are relatively rare in the present-day genomes. REVIEWERS: This article was reviewed by Martijn A Huynen, Toni Gabaldón and Fyodor Kondrashov. BioMed Central 2013-12-19 /pmc/articles/PMC3892064/ /pubmed/24354654 http://dx.doi.org/10.1186/1745-6150-8-32 Text en Copyright © 2013 Kannan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Kannan, Lavanya
Li, Hua
Rubinstein, Boris
Mushegian, Arcady
Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life
title Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life
title_full Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life
title_fullStr Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life
title_full_unstemmed Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life
title_short Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life
title_sort models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3892064/
https://www.ncbi.nlm.nih.gov/pubmed/24354654
http://dx.doi.org/10.1186/1745-6150-8-32
work_keys_str_mv AT kannanlavanya modelsofgenegainandgenelossforprobabilisticreconstructionofgenecontentinthelastuniversalcommonancestoroflife
AT lihua modelsofgenegainandgenelossforprobabilisticreconstructionofgenecontentinthelastuniversalcommonancestoroflife
AT rubinsteinboris modelsofgenegainandgenelossforprobabilisticreconstructionofgenecontentinthelastuniversalcommonancestoroflife
AT mushegianarcady modelsofgenegainandgenelossforprobabilisticreconstructionofgenecontentinthelastuniversalcommonancestoroflife