Cargando…

Phylogenetic identification of lateral genetic transfer events

BACKGROUND: Lateral genetic transfer can lead to disagreements among phylogenetic trees comprising sequences from the same set of taxa. Where topological discordance is thought to have arisen through genetic transfer events, tree comparisons can be used to identify the lineages that may have shared...

Descripción completa

Detalles Bibliográficos
Autores principales: Beiko, Robert G, Hamilton, Nicholas
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1431587/
https://www.ncbi.nlm.nih.gov/pubmed/16472400
http://dx.doi.org/10.1186/1471-2148-6-15
_version_ 1782127211132747776
author Beiko, Robert G
Hamilton, Nicholas
author_facet Beiko, Robert G
Hamilton, Nicholas
author_sort Beiko, Robert G
collection PubMed
description BACKGROUND: Lateral genetic transfer can lead to disagreements among phylogenetic trees comprising sequences from the same set of taxa. Where topological discordance is thought to have arisen through genetic transfer events, tree comparisons can be used to identify the lineages that may have shared genetic information. An 'edit path' of one or more transfer events can be represented with a series of subtree prune and regraft (SPR) operations, but finding the optimal such set of operations is NP-hard for comparisons between rooted trees, and may be so for unrooted trees as well. RESULTS: Efficient Evaluation of Edit Paths (EEEP) is a new tree comparison algorithm that uses evolutionarily reasonable constraints to identify and eliminate many unproductive search avenues, reducing the time required to solve many edit path problems. The performance of EEEP compares favourably to that of other algorithms when applied to strictly bifurcating trees with specified numbers of SPR operations. We also used EEEP to recover edit paths from over 19 000 unrooted, incompletely resolved protein trees containing up to 144 taxa as part of a large phylogenomic study. While inferred protein trees were far more similar to a reference supertree than random trees were to each other, the phylogenetic distance spanned by random versus inferred transfer events was similar, suggesting that real transfer events occur most frequently between closely related organisms, but can span large phylogenetic distances as well. While most of the protein trees examined here were very similar to the reference supertree, requiring zero or one edit operations for reconciliation, some trees implied up to 40 transfer events within a single orthologous set of proteins. CONCLUSION: Since sequence trees typically have no implied root and may contain unresolved or multifurcating nodes, the strategy implemented in EEEP is the most appropriate for phylogenomic analyses. The high degree of consistency among inferred protein trees shows that vertical inheritance is the dominant pattern of evolution, at least for the set of organisms considered here. However, the edit paths inferred using EEEP suggest an important role for genetic transfer in the evolution of microbial genomes as well.
format Text
id pubmed-1431587
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14315872006-04-07 Phylogenetic identification of lateral genetic transfer events Beiko, Robert G Hamilton, Nicholas BMC Evol Biol Methodology Article BACKGROUND: Lateral genetic transfer can lead to disagreements among phylogenetic trees comprising sequences from the same set of taxa. Where topological discordance is thought to have arisen through genetic transfer events, tree comparisons can be used to identify the lineages that may have shared genetic information. An 'edit path' of one or more transfer events can be represented with a series of subtree prune and regraft (SPR) operations, but finding the optimal such set of operations is NP-hard for comparisons between rooted trees, and may be so for unrooted trees as well. RESULTS: Efficient Evaluation of Edit Paths (EEEP) is a new tree comparison algorithm that uses evolutionarily reasonable constraints to identify and eliminate many unproductive search avenues, reducing the time required to solve many edit path problems. The performance of EEEP compares favourably to that of other algorithms when applied to strictly bifurcating trees with specified numbers of SPR operations. We also used EEEP to recover edit paths from over 19 000 unrooted, incompletely resolved protein trees containing up to 144 taxa as part of a large phylogenomic study. While inferred protein trees were far more similar to a reference supertree than random trees were to each other, the phylogenetic distance spanned by random versus inferred transfer events was similar, suggesting that real transfer events occur most frequently between closely related organisms, but can span large phylogenetic distances as well. While most of the protein trees examined here were very similar to the reference supertree, requiring zero or one edit operations for reconciliation, some trees implied up to 40 transfer events within a single orthologous set of proteins. CONCLUSION: Since sequence trees typically have no implied root and may contain unresolved or multifurcating nodes, the strategy implemented in EEEP is the most appropriate for phylogenomic analyses. The high degree of consistency among inferred protein trees shows that vertical inheritance is the dominant pattern of evolution, at least for the set of organisms considered here. However, the edit paths inferred using EEEP suggest an important role for genetic transfer in the evolution of microbial genomes as well. BioMed Central 2006-02-11 /pmc/articles/PMC1431587/ /pubmed/16472400 http://dx.doi.org/10.1186/1471-2148-6-15 Text en Copyright © 2006 Beiko and Hamilton; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Beiko, Robert G
Hamilton, Nicholas
Phylogenetic identification of lateral genetic transfer events
title Phylogenetic identification of lateral genetic transfer events
title_full Phylogenetic identification of lateral genetic transfer events
title_fullStr Phylogenetic identification of lateral genetic transfer events
title_full_unstemmed Phylogenetic identification of lateral genetic transfer events
title_short Phylogenetic identification of lateral genetic transfer events
title_sort phylogenetic identification of lateral genetic transfer events
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1431587/
https://www.ncbi.nlm.nih.gov/pubmed/16472400
http://dx.doi.org/10.1186/1471-2148-6-15
work_keys_str_mv AT beikorobertg phylogeneticidentificationoflateralgenetictransferevents
AT hamiltonnicholas phylogeneticidentificationoflateralgenetictransferevents